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OM protein - protein search, using sw model 



Run on: 



Title: 

Perfect score: 
Sequence : 

Scoring table: 



September 28, 2004, 09:07:19 ; Search time 126 Seconds 

(without alignments) 
953.037 Million cell updates/sec 

US-10-070-532-2 
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1 ME P SAT P GAQMGVP P G S RE P CSVSKISEHWLTS VTTVLP 425 

BLOSUM62 

Gapop 10.0 , Gapext 0.5 



Searched: 1586107 seqs, 282547505 residues 

Total number of hits satisfying chosen parameters: 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 



1586107 



Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 
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A_Geneseq_2 9Jan04 : * 

1: geneseqpl980s : * 

2: geneseqpl990s : * 

3: geneseqp2000s : * 

4 : geneseqp2001s : * 

5 : geneseqp2002s : * 

6: genes eqp2003as : * 

7 : geneseqp2003bs : * 

8 : geneseqp2004s : * 

Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 
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ALIGNMENTS 



RESULT 1 
AAU00438 

ID AAU00438 standard; protein; 425 AA. 
XX 

AC AAU00438; 
XX 

DT 17-MAY-2001 (first entry) 
XX 

DE Human neuropeptide receptor. 
XX 

KW Human; neuropeptide receptor; neuropeptide Y receptor; obesity; 

KW nervous system disorder; hyperprolif erative disorder; diabetes mellitus; 

KW cardiovascular disorder; autoimmune disorder; infectious disorder; 

KW eating behaviour disorder; narcolepsy; neurological disease; 

KW narcotics addiction; nicotine addiction; alcohol addiction; gene therapy; 

KW protein co-ordinate data. 



XX 

OS Homo sapiens. 
XX 

FH Key Location/Qualifiers 

FT Region 47. .72 

FT /label= TM1 

FT /note= "Transmembrane region 1" 

FT Region 83. .106 

FT /label= TM2 

FT /note= "Transmembrane region 2" 

FT Region 112. .142 

FT /label= TM3 

FT /note= "Transmembrane region 3" 

FT Region 163. .189 

FT /label= TM4 

FT /note= "Transmembrane region 4" 

FT Region 214. .239 

FT /label= TM5 

FT /note= "Transmembrane region 5" 

FT Region 2 99. .327 

FT /label= TM6 

FT /note= "Transmembrane region 6" 

FT Region 335. .363 

FT /label= TM7 

FT /note= "Transmembrane region 7" 

XX 

PN WO200117532-A1. 
XX 

PD 15-MAR-2001. 
XX 

PF 07-SEP-2000; 2000WO-US024518 . 
XX 

PR 10-SEP-1999; 99US-00393696 . 
XX 

PA (HUMA-) HUMAN GENOME SCI INC. 
XX 

PI Soppet DR, Li Y, Rosen CA; 
XX 

DR WPI; 2001-183276/18. 

DR N-PSDB; AAS00491. 
XX 

PT A new nucleic acid encoding a human neuropeptide receptor polypeptide, 

PT useful for preventing, treating or ameliorating obesity, narcolepsy, 

PT neurological disease and addiction to narcotics, nicotine and alcohol 
XX 

PS Claim 3; Fig 4; 385pp; English. 
XX 

CC The present sequence represents a novel human neuropeptide receptor which 

CC shows sequence homology to the neuropeptide Y receptor. Two splice 

CC variants of the neuropeptide receptor (AAU00439-AAU00440 ) and a possible 

CC mutant (AAU00442) are also described. Polypeptides and polynucleotides of 

CC the neuropeptide receptor are useful for diagnosing, preventing, or 

CC treating a pathological condition in a subject related to the central 

CC nervous and peripheral nervous systems (CNS and PNS) . The polypeptides 

CC and polynucleotides may be used to treat hyperprolif erative, 

CC cardiovascular, autoimmune, nervous system or infectious disorders e.g. 

CC cancer, heart disease, rheumatoid arthritis, Alzheimer f s disease, HIV 



infection and diabetes mellitus. In particular they are useful for 
preventing, treating or ameliorating a medical condition in a mammal such 
as obesity/eating behaviour disorders, narcolepsy, neurological disease, 
addiction to narcotics, nicotine and alcohol, chronic pain, acute pain, 
migraine headaches and anxiety disorders. The polynucleotides encoding 
the neuropeptide receptor can also be used in gene therapy methods for 
treating such diseases 

Sequence 425 AA; 



Query Match 100.0%; Score 2219; DB 4; Length 425; 

Best Local Similarity 100.0%; Pred. No. 6.8e-221; 

Matches 425; Conservative 0; Mismatches 0; Indels 0; Gaps 
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l 


MEPSATPGAQMGVPPGSREPSPVPPDYEDEFLRYLWRDYLYPKQYEWVLIAAYVAVFWA 

1 1 1 M 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 I I I M 1 1 1 1 1 

MEPSATPGAQMGVPPGSREPSPVPPDYEDEFLRYLWRDYLYPKQYEWVLIAAYVAVFWA 
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1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 I I 
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' 1 1 1 1 1 1 1 1 1 1 1 II 1 II 1 I I I M 1 1 1 1 1 1 1 1 1 1 1 || || | | | | 
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1 1 1 1 1 1 1 1 1 II 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 | | | | 1 1 1 1 M 1 1 1 1 1 1 1 1 | | | M 1 1 1 1 1 1 1 1 1 
KLWGRQIPGTTS7ULVRNWKRPSDQLGDLEQGLSGEPQPRARAFLAEVKQMRARRKTAKML 
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1 1 1 1 > 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i | | | | | | | ! | I | | | | | | | | | | | | i | | | | | | | , 
MWLLVFALC YL P I SVLNVLKRVFGMFRQAS DREAVYAC FT FS HWLVYANS AAN PIIYNF 
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' 1 M 1 1 1 1 1 1 1 1 1 1 I I I 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 Ml 

LSGKFREQFKAAFSCCLPGLGPCGSLKAPSPRSSASHKSLSLQSRCSVSKISEHWLTSV 
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Db 
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420 


Qy 
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TTVLP 42 5 
Mill 

TTVLP 425 




Db 


421 





RESULT 2 
AAB67489 

ID AAB67489 standard; protein; 425 AA. 
XX 

AC AAB67489; 
XX 

DT 29-MAY-2001 (first entry) 
XX 

DE Amino acid sequence of a human hypocretin (orexin) receptor 1. 
XX 



Novel isolated nucleic acid molecule encoding hypocretin (orexin) 
receptor 1 useful for treating and diagnosing narcolepsy. 



KW Human; hypocretin receptor 1; orexin receptor 1; HCRTR1; chromosome 1; 

KW lp33; central nervous, system modulator. 
XX 

OS Homo sapiens. 
XX 

PN WO200114555-A1. 
XX 

PD 01-MAR-2001. 
XX 

PF 22-AUG-2000; 2000WO-US022986 . 
XX 

PR 23-AUG-1999; 99US-00379083 . 

PR 07-JAN-2000; 2000US-00479128 . 
XX 

PA (DECO-) DECODE GENETICS EHF. 
XX 

PI Olafsdottir BR, Gulcher J; 
XX 

DR WPI; 2001-211306/21. 

DR N-PSDB; AAF55159. 
XX 
PT 
PT 
XX 

PS Disclosure; Fig 1A; 44pp; English. 
XX 

CC The present sequence represents a human hypocretin (orexin) receptor 1 

CC (HCRTR1) polypeptide. The HCRTR1 gene is present on chromosome 1, 

CC location lp33. It is likely that a mutation in the HCRTR1 gene is 

CC associated with narcolepsy. HCRTR1 is a central nervous system modulator. 

CC The HCRTR1 polypeptide and polynucleotide are useful for diagnosing or 

CC treating narcolepsy in an individual. The HCRTR1 polynucleotide is a 

source of probes and primers, and is also used to produce the protein 

CC recombinantly 
XX 

SQ Sequence 425 AA; 

Query Match 100.0%; Score 2218; DB 4; Length 425; 

Best Local Similarity 99.8%; Pred. No. 8.6e-221; 

Matches 424; Conservative 1; Mismatches 0; Indels 0; Gaps 0 

1 MEPSATPGAQMGVPPGSREPSPVPPDYEDEFLRYLWRDYLYPKQYEWVLIAAWAVFVVA 60 
I I I I I M I I I I I I I I I I I I I I I I I I I I I I I | | | | | | | | | | | | || | | | | | | || | | | 

Db 1 MEPSA TPGAQMGVPPGSREPSPVPPDYEDEFLRYLWRDYLYPKQYEWVLIAAYVAVFWA 60 

61 LVGi\iTLVCIAWRNHHMRTVTNYFIWLSIAI)VLVTAICLPA 12 0 

HI I I I I I M I 1 I I I I II I I I I I I I I | | | | | | | | | | | | | | | || | | | | | || 

Db 61 LVGNTLVCIAWRNHHMRTVTNYFI VT^LSLADVLVTAICLPASLLVDITESW^ 120 

Qy 121 VIp YLQAVSVSVAVLTLSFIALDRWYAICHPLLFKSTARRARGSILGIWAVSLAIMVPQA 180 

I II II I I I II I M I I I I I I I I I I | | | | | | | | | | | M | | | || | | | | | || | || | | | | | | | | | 

Db 121 VIPYLQAVSVSVAVLTLSFIAIjDRWYAICHPLLFKSTARRARGSILGIWAVSLAIMVPQA 180 

Qy 181 AVMECSSVTiPELANRTRLFSVCDERWADDLYPKIYHSCFFI VTTYLAPLGLMAMAYF 240 

I I I I I I I I I I I I I I I II I I I I I | | | | | || | | | | || | | | || || 

Db 181 AVMECSSVLPELANRTRLFSVCDERWADDLYPKIYHSCFFIVTYLAPLGLMAMAYFQIFR 24 0 



Qy 


241 


Db 


241 


Qy 


301 


Db 


301 


Qy 


361 


Db 


361 


Qy 


421 


Db 


421 



KLWGRQIPGTTSALVRNWKRPSDQLGDLEQGLSGEPQPRARAFIAEVKQMRARRKT7VKML 300 

N I I I I I I I I I I I I I I I I I M I I I II I I I I I I I I I I I M I | I | | | | | | | | m I I II I I I I 



N I I I I I I I I I I I I I I I I I I I I I I I I I I I I II | MINI 



LSGKFREQFKAAFSCCLPGLGPCGSLKAPSPRSSASHKSLSLQSRCSVSKISEHWLTSV 420 
I N I I I I M I I I I I I I I I I I I I I I I I I I I M I I I I I I II I I I I I I I I : I I I | I I I I I I I I 
LSGKFREQFKAAFSCCLPGLGPCGSLKAPSPRSSASHKSLSLQSRCSISKISEHWLTSV 420 



I I 



KW 

KW 
KW 



RESULT 3 
AAU11188 

ID AAU11188 standard; protein; 425 AA. 
XX 

AC AAU11188; 
XX 

DT 25-FEB-2002 (first entry) 
XX 

DE Human G protein-coupled receptor HFGAN72X variant. 
XX 

KW Human; G protein-coupled receptor; GPCR; HFGAN72X; bacterial infection; 
fungal infection; protozoan infection; viral infection; 

human immunodeficiency virus; HIV; cancer; diabetes; Parkinson's disease; 
osteoporosis; myocardial infarction; ulcer; asthma; allergy; 

KW angina pectoris; renal disease; depression; schizophrenia; anorexia; 

KW obesity; Kallman f s syndrome; hypothalamic disorder; 

KW idiopathic hormone deficiency; gigantism; migraine; pain; lung disease; 

KW burn; sleep disorder; jet lag; Huntington's disease; gene therapy. 

XX 

OS Homo sapiens. 
XX 

PN US2001025031-A1. 
XX 

PD 27-SEP-2001. 
XX 

PF 06-APR-2001; 2001US-00828538 . 
XX 

PR 08-JUN-1998; 98US-008 8524P . 

PR 22-JUL-1998; 98US-0QS3726P . 

PR 08-JUN-1999; 99US-00328014 . 
XX 

PA (ELLI/) ELLIS C E. 

PA (KWOK/) KWOK C. 

PA (BODS/) BODSWORTH N J. 

PA (HALS/) HALSEY W. 

PA (HORN/) HORN S V. 

XX 

PI Ellis CE, Kwok C, Bodsworth NJ, Halsey W, Horn SV; 
XX 

DR WPI; 2001-624968/72. 



DR N-PSDB; AAS17464, 
XX 



PT Isolated HFGAN72 receptor useful for treatment of a patient having need 

PT of HFGAN72 receptor and in the detection and treatment of disease, e.g. 

PT infections such as bacterial, fungal, protozoan and viral infections and 

PT cancers . 
XX 

PS Claim 23; Fig 6; 75pp; English. 
XX 

The invention relates to an isolated polypeptide, the HFGAN72 receptor or 

CC its variant, encoded by the 8 exon sequences given in the specification. 

CC HFGAN72 is a G protein-coupled receptor (GPCR) . HFGAN72 is useful for the 

CC treatment of a patient having need of HFGAN72 receptor where HFGAN72 is 

CC administered by providing to the patient DNA encoding HFGAN72 and 

expressing HFGAN72 in vivo (i.e by gene therapy). HFGAN72 is particularly 

CC useful for applications in the detection and treatment of disease, e.g. 

CC infections such as bacterial, fungal, protozoan and viral infections, 

CC particularly infections caused by human immunodeficiency virus (HIV)-l or 

CC HIV-2, cancers, diabetes, Parkinson's disease, osteoporosis, myocardial 

CC infarction, ulcers, asthma, allergies, angina pectoris, renal disease, 

CC depression, schizophrenia, anorexia, obesity, Kallman's syndrome, 

CC hypothalamic disorders, idiopathic hormone deficiency (e.g. gigantism), 

CC migraine, pain, lung diseases, burns, sleep disorders, jet lag, 

CC Huntington's disease and many other diseases and disorders given in the 

CC specification. The present sequence is the human HFGAN72X variant, 

CC encoded by an alternative allele of HFGAN72 
XX 

SQ Sequence 425 AA; 



Query Match 99.8%; Score 2215; DB 4; Length 425; 

Best Local Similarity 99.8%; Pred. No. 1.8e-220; 

Matches 424; Conservative 0; Mismatches 1; Indels 0; Gaps 0; 



Qy 


1 


MEPSATPGAQMGVPPGSREPSPVPPDYEDEFLRYLWRDYLYPKQYEWVLIAAYVAVFWA 

1 1 i 1 M 1 1 1 1 II 1 1 1 1 1 1 1 I I I | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | || | | | 

MEPSATPGAQMGVPPGSREPSPVPPDYEDEFLRYLWRDYLYPKQYEWVLIAAYVAVFWA 


60 


Db 


1 


60 


Qy 


61 


L VGN T L VC LAW RN H HMRT VTN Y F I VN L S LAD VL VT AI C L PAS L LVD I T E S W L FGHAL C K 

M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

LVGNTLVCIAWRNHHMRTWNYFIWLSLADVLVTAICLPASLLVDITESWLFGHALCK 


120 


Db 


61 


120 


Qy 


121 


VI P YL QAVS VS VAVLT L S FI ALDRWYAI CHPLLFKSTARRARGS I LGIWAVSLAIMVPQA 

1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I I I M | | | | | M | | | | | | | | | | | | || | | | | | 

VI PYLQAVSVSVAVLTLSFIALDRWYAI CHPLLFKSTARRARGS I LGIWAVSLAIMVPQA 


180 


Db 


121 


180 


Qy 

Db 


181 
181 


AVMECSSVLPELANRTRLFSVCDijRWADDLYPKIYHSCFFIVTYLAPLGLMAMAYFQIFR 

M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

AVMECSSVLPELANRTRLFSVCDERWADDLYPKIYHSCFFIVTYLAPLGLMAMAYFQIFR 


240 
240 


Qy 


241 


KLWGRQIPGTTSALVRNWKRPSDQLGDLEQGLSGEPQPRARAFLAEVKQMRARRKTAKML 

1 1 1 1 1 1 1 1 1 1 1 1 I I 1 1 1 1 1 1 1 1 1 1 1 1 II | 1 1 1 I I I | | 1 1 1 1 

KLWGRQIPGTTSALVRNWKRPSDQLGDLEQGLSGEPQPRGRAFLAEVKQMRARRKTAKML 


300 


Db 


241 


300 


Qy 


301 


MWLLVFALCYLPISVLNVLKRVFGMFRQASDREAVYACFTFSHWLVYANSAANPIIYNF 

M 1 1 II 1 1 II 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 I | | | | | 

MWLLVFALCYLPISVLNVLKRVFGMFRQASDREAVYACFTFSHWLVYANSAANPIIYNF 


360 


Db 


301 


360 



361 LSGKFREQFKAAFSCCLPGLGPCGSLKAPSPRSSASHKSLSLQSRCSVSKISEHWLTSV 420 
I I I I I I I I I I I I I I I I I I -I I I I | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | 

Db 361 ^SGKFREQFKAAFSCCLPGLGPCGSLKAPSPRSSASHKSLSLQSRCSVSKISEHWLTSV 420 

Qy 421 TTVLP 425 

I I I I I 

Db 421 TTVLP 425 



AAW80456; 

26-JAN-1999 (first entry) 
G-protein coupled receptor 



RESULT 4 
AAW80456 

ID AAW80456 standard; protein; 
XX 
AC 
XX 
DT 
XX 
DE 
XX 
KW 
KW 
KW 
KW 
XX 
OS 
XX 
PN 
XX 
PD 
XX 
PF 
XX 
PR 
XX 
PA 
XX 

PI 

XX 
DR 
DR 
XX 
PT 
PT 
PT 
XX 
PS 
XX 

cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 



425 AA. 



(HFGAN72X) polypeptide. 



HIV infection; anorexia; 
acute heart failure; 



cancer; 



G-protein coupled receptor; HFGAN72X; 
bulimia; asthma; Parkinson's disease; 

urinary retention; osteoporosis; angina pectoris; myocardial infarction; 
benign prostatic hypertrophy; neurological disorder. 

Homo sapiens. 

EP875566-A2. 

04-NOV-1998. 

27-OCT-1997; 97EP-00308563 . 

30-APR-1997; 97US- 008 46704 . 

(SMIK ) SMITHKLINE BEECHAM CORP. 

Bergsma DJ, Ellis CE; 

WPI; 1998-559432/48. 
N-PSDB; AAV63468. 



HIV 



New human G-protein coupled receptor HFGAN72X polypeptide and 
polynucleotide - useful as diagnostic reagents and for treating e.g. 
infection, cancer and Parkinson's disease. 

Claim 1; Page 7-8; 2 4pp; English. 

The present sequence represents a G-protein coupled receptor (HFGAN72X) 
polypeptide. HFGAN72X polypeptides and polynucleotides are useful for 
diagnosing diseases related to over or under expression of HFGAN72X 
proteins by identifying mutations in the HFGAN72X gene using HFGAN72X 
probes, or determining HFGAN72X protein or mRNA expression levels. 
HFGAN72X polypeptides are also useful for screening for compounds which 
affect activity of the protein. Diseases that can be treated with 
HFGAN72X include HIV infections, pain, anorexia, cancers, bulimia, 
asthma, Parkinson's disease, acute heart failure, hypotension, 
hypertension, urinary retention, osteoporosis, angina pectoris, 



CC myocardial infarction, ulcers, allergies, benign prostatic hypertrophy, 

CC and psychotic and neurological disorders 

XX 

SQ Sequence 425 AA; 

Query Match 99.8%; Score 2214; DB 2; Length 425; 

Best Local Similarity 99.5%; Pred. No. 2.2e-220; 

Matches 423; Conservative 1; Mismatches 1; Indels 0; Gaps C 

Q y 1 M EPSATPGAQMGVPPGSREPSPVPPDYEDEFLRYLWRDYLYPKQYEWVLIAAYVAVFWA 60 

I > I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I | | | | | | | | | | | | | | | | | | | | | | | | | 
1 MEP SATPGAQMGVP PGS REPS PVPPDYEDEFLRYLWRDYLYPKQYEWVLI AAYVAVEWA 60 



Db 



QY 61 LVGNTLVCIAWRNHHMRTWNYFIWLSLADVLVTAICLPASLLVDITESWLFGHALCK 12 0 

I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I | | | | | | M I M I I I I I M I I I I I I I 

LVGNTLVCIAWRNHHMRTVTNYFIVNLSLADVLVTAICLPASLLVDITESWLFGHALCK 120 



Db 61 



Q y 121 VI P Y L Q AVS VS VAVLT L S F I AL D RW Y AI CHPLLFKS T ARRARG S I L G I WAV S LAI MVP Q A 180 

1 I I I I I I I I I I I I I I I I II I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 121 VIPYLQAVSVSVAVLTLSFIALDRWYAICHPLLFKSTARRARGSILGIWAVSLAIMVPQA 180 

Q y 181 AVMECSS VXPELANRTRLFSVCDERWAI)D 240 

1 I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I | | | | I I I I I I I II I I I I I I 

Db 181 AVMECSSVLPELANRTRLFSVCDERWADDLYPKIYHSCFFIVTYLAPLGLMAMAYFQIFR 240 

Q y 241 KLWGRQIPGTTSALVRNWKRPSDQLGDLEQGLSGEPQPRARAFLAEVKQMRARRKTAKML 300 

I I M I I I I I I I I M | | | I I I I I I I I I I I I I I I I I I I I I I | | M M || | | || | | 

Db 241 KLWGRQIPGTTSALVRNWKRPSDQLGDLEQGLSGEPQPRGRAFLAEVKQMRARRKTAXML 300 



Qy 301 MWLLVFALCYLPISVLNVLKRVFGMFRQASDREAVYACFTFSHWLVYANSAANPIIYNF 360 

I I I I I I I I I I I I I I I I I I I I I II I I I I I II I I I I I I I I I I I I II I I I I I 

Db 301 MWL L VFAL C YL P I S VLN VL K RVF GM FRQ AS D REAVYAC FT F S HWL VYAN S AAN P 1 1 YN F 360 

Q y 361 LSGKFREQFKAAFSCCLPGLGPCGSLKAPSPRSSASHKSLSLQSRCSVSKISEHWLTSV 420 

Nllllll I M I II I I I I I I II I I I I I I I I I! I , I l:i I , I I I I | I | | , 

Db 361 LSGKFR EQFKAAFSCCLPGLGPCGSLKAPSPRSSASHKSLSLQSRCSISKISEHWLTSV 420 

Qy 421 TTVLP 425 

Mill 

Db 421 TTVLP 425 



RESULT 5 
AAU11186 

ID AAU11186 standard; protein; 425 AA. 
XX 

AC AAU11186; 
XX 

DT 25-FEB-2002 (first entry) 
XX 

DE Human G protein-coupled receptor HFGAN72X 
XX 
KW 
KW 
KW 
KW 
KW 



Human; G protein-coupled receptor; GPCR; HFGAN72X; bacterial infection; 
fungal infection; protozoan infection; viral infection; 

human immunodeficiency virus; HIV; cancer; diabetes; Parkinson's disease; 
osteoporosis; myocardial infarction; ulcer; asthma; allergy; 
angina pectoris; renal disease; depression; schizophrenia; anorexia; 



KW obesity; Kallman f s syndrome; hypothalamic disorder; 

KW idiopathic hormone deficiency; gigantism; migraine; pain; lung disease; 

KW burn; sleep disorder; jet lag; Huntington's disease; gene therapy 

XX 

OS Homo sapiens. 
XX 

PN US2001025031-A1. 
XX 

PD 27-SEP-2001. 
XX 

PF 06-APR-2001; 2 001US-00828538 . 
XX 

PR 08-JUN-1998; 98US-0088524P . 

PR 22-JUL-1998; 98US-0093726P . 

PR 08-JUN-1999; 99US-00328014 . 
XX 

PA (ELLI/) ELLIS C E. 

PA (KWOK/) KWOK C. 

PA (BODS/) BODSWORTH N J. 

PA (HALS/) HALSEY W. 

PA (HORN/) HORN S V. 

XX 

PI Ellis CE, Kwok C, Bodsworth NJ, Halsey W, Horn SV; 
XX 

DR WPI; 2001-624968/72. 
XX 

PT Isolated HFGAN72 receptor useful for treatment of a patient having need 

PT of HFGAN72 receptor and in the detection and treatment of disease, e.g. 

PT infections such as bacterial, fungal, protozoan and viral infections and 

PT cancers . 
XX 

PS Claim 8; Fig 2; 75pp; English. 
XX 

CC The invention relates to an isolated polypeptide, the HFGAN72 receptor or 

CC its variant, encoded by the 8 exon sequences given in the specification. 

HFGAN72 is a G protein-coupled receptor (GPCR) . HFGAN72 is useful for the 

CC treatment of a patient having need of HFGAN72 receptor where HFGAN72 is 

CC administered by providing to the patient DNA encoding HFGAN72 and 

CC expressing HFGAN72 in vivo (i.e by gene therapy). HFGAN72 is particularly 

CC useful for applications in the detection and treatment of disease, e.g. 

CC infections such as bacterial, fungal, protozoan and viral infections, 

CC particularly infections caused by human immunodeficiency virus (HIV) -1 or 

CC HIV-2, cancers, diabetes, Parkinson's disease, osteoporosis, myocardial 

CC infarction, ulcers, asthma, allergies, angina pectoris, renal disease, 

CC depression, schizophrenia, anorexia, obesity, Kallman T s syndrome, 

CC hypothalamic disorders, idiopathic hormone deficiency (e.g. gigantism), 

CC migraine, pain, lung diseases, burns, sleep disorders, jet lag, 

CC Huntington's disease and many other diseases and disorders given in the 

CC specification. The present sequence is the human HFGAN72X receptor being 

CC the product of a splice variant of HFGAN72 
XX 

SQ Sequence 425 AA; 



Query Match 99.8%; Score 2214; DB 4; Length 425; 

Best Local Similarity 99.5%; Pred. No. 2.2e-220; 

Matches 423; Conservative 1; Mismatches 1; Indels 0; Gaps 



0; 



1 MEPSATPGAQMGVPPGSREPSPVPPDYEDEFLRYLWRDYLYPKQYEWVLIAAYVAVFWA 60 

MMMIIII I I I M I I I I I I I I I I I | | | | | | | | M I I I I II I I I I | I I I 

1 MEPSATPGAQMGVPPGSREPSPVPPDYEDEFLRYLWRDYLYPKQYEWVLIAAYVAVFWA 60 

61 LVGNTLVCIAlWRNHHMRTWNYFIVNLSIAI)VLWAICLPASLLVDITESWLFGHALCK 120 

MMMIIII I I I II I I I II I I I I I I I I I I I I I I II I I I I I I I I I I I 

61 LVGNTLVCIAWRNHHMRTVTNYFIWLSLADVLVTAICLPASLLVDITESWLFGHALCK 120 

121 VIPYLQAVSVSVAVLTLSFIALDRWYAICHPLLFKSTARRARGSILGIWAVSLAIMVPQA 180 

M I M I I M I I I I I I I I I I I I I | | | | | | | | | | | | | | | 

121 VIPYLQAVSVSVAVLTLSFIALDRWYAICHPLLFKSTARRARGSILGIWAVSLAIMVPQA 180 

181 AVMECSSVLPELANRTRLFSVCDERWADDLYPKIYHSC^ 24 0 

M I I I I I I I II I I I I I II I I I Ml | | | | 

181 AVMECSSVLPELANRTRLFSVCDERWADDLYPKIYHSCFFIWYIAPLGLMAMAYFQIFR 240 

241 KLWGRQIPGTTSALVRNWKRPSDQLGDLEQGLSGEPQPRARAFIAEVKQMRARRKTAKML 300 

M I I I M I I I I I I I I I I I I I I I I I | | Ml I I I II I I I I I I I II I I I I I I 

241 KLWGRQIPGTTSALVRNWKRPSDQLGDLEQGLSGEPQPRGRAFLAEVKQMRARRKTAKML 30 0 

301 MWLLVFALC YLP I S VLNVLKRVFGMFRQAS DREAVYAC FT FSHWLVYANS AANP 1 1 YN F 360 

M I M M I II I I I I I I I I I I I I I I I I I I I I I | | | | | | | I I I I I I I I I I I I I I I I I 

301 MWLLVFALCYLPISVLNVLKRVFGMFRQASDREAVYACFTFSHWLVYANSAANPIIYNF 360 

361 LSGKFREQFKAAFSCCLPGLGPCGSLKAPSPRSSASHKSLSLQSRCSVSKISEHWLTSV 42 0 

I I I M I I I II I I | I | | || | | | | | | | | | | | | | | | | | | | | | | | | | || | | : || | | | | | | | | | | 
361 LSGKFREQFKAAFSCCLPGLGPCGSLKAPSPRSSASHKSLSLQSRCSISKISEHWLTSV 42 0 

421 TTVLP 425 

I I I I I 
421 TTVLP 425 



RESULT 6 
AAB67079 

ID AAB67079 standard; protein; 425 AA. 
XX 

AC AAB67 079; 
XX 

DT 10-APR-2001 (first entry) 
XX 

DE Human HFGAN72 receptor SEQ ID NO: 13. 

XX 

KW Human; mouse; rat; Lig72A; Lig72B; neuropeptide receptor; HFGAN72 ; 

KW truncation mutant; ligand; neurodegenerative disorder; pain; 

KW eating disorder; behaviour disorder; mood disorder. 
XX 

OS Homo sapiens . 
XX 

PN WO200100787-A2 . 
XX 

PD 04-JAN-2001. 
XX 

PF 22-JUN-2000; 2 000WO-US017251 . 
XX 

PR 25-JUN-1999; 99US-014 1156P . 
XX 



Qy 

Db 

Qy 

Db 

Qy 
Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 



PA (SMIK ) SMITHKLINE BEECHAM CORP. 
PA (SMIK ) SMITHKLINE BEECHAM PLC. 
XX 

PI Bingham S, Darker J, Liu W, Martin JD, Parsons AA, Patel SR; 
XX 

DR WPI; 2001-071483/08. 
XX 

PT Polynucleotides encoding Lig 72A polypeptides or their variants, which 

PT are useful in the treatment of a disease or disorder associated with 

PT pain, e.g. enhanced or exaggerated sensitivity to pain, hyperalgesia, 

PT neuropathic pain and back pain. 
XX 

PS Claim 8; Fig 7; lOlpp; English. 
XX 

CC The present invention provides the protein and coding sequences for the 

CC human, mouse and rat HFGAN receptor ligand Lig72A. It also provides 

CC truncated mutant versions. These, and their agonists and antagonists, are 

CC all useful in the treatment of eating, neurodegenerative, behaviour, 

mood, sexual, hormonal and sleep disorders, pain, depression, epilepsy 

CC and acute inflammatory conditions 
XX 

SQ Sequence 425 AA; 



Query Match 99.8%; Score 2214; DB 4; Length 425; 

Best Local Similarity 99.5%; Pred. No. 2.2e-220; 

Matches 423; Conservative 1; Mismatches 1; Indels 0; Gaps 0; 



Qy 


i 


ME P SAT P GAQMGVP P G S RE P S P VP P D YED E FL R YLWRD YL YP KQ YEWVL I AAYVAVFVVA 

1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 I I ! | | | 1 1 1 1 II 1 1 1 1 1 1 1 I I I I | | | | | | | | M 1 1 1 1 1 1 1 1 1 1 I 

ME P SAT P GAQMGVP P G S RE P S P VP P D YED E FLR YLWRD YLYP KQ YEWVL I AAYVAVFVVA 


60 


Db 


l 


60 


Qy 


61 


LVGNTLVCLAWRNHHMRTVTNYFIWLSI^ 

Ml M 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I | 

LVGNTLVCLAWRNHHMRTVTNYFIWLSLADVLVTAICLPASLLVDITESWLFGHALCK 


120 


Db 


61 


120 


Qy 


121 


VI P YLQAVS VS VAVLT L S F I AL D RW YAI CHPLLFKS T ARRARG S I L G I WAVS LAI MVP Q A 

N 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

VI P YLQAVS VS VAVLT LSFI7VLDRWYAI CHPLLFKS TAR RARGS I LG I WAVS LAI MVPQA 


180 


Db 


121 


180 


Qy 


181 


AV14ECSSVLPELANRTRLFSVCDERWADDLYPKIYHSCFFIVTYLAPLGLMAMAYFQIFR 

M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 II I | | | | | | | | | | | | | | | | | | | | | | | | | | 

AVMECSSVXPELANRTRLFSVCDERWADDLYPKIYHSCFFIWYL^ 


240 


Db 


181 


240 


Qy 


241 


KLWGRQIPGTTSALVRNWKRPSDQLGDLEQGLSGEPQPRARAFLAEVKQMRARRKTAKML 

M M M 1 1 1 M 1 M M 1 1 1 1 1 II 1 J 1 1 1 1 1 1 1 1 1 I I I I | 1 1 1 1 I I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

KLWGRQIPGTTSALVRNWKRPoDQLGDLEQGLSGEPQPRGRAFLAEVKQMRARRKTAKML 


300 


Db 


241 


300 


Qy 


301 


MWLLVFALCYLPI SVLNVLKRVFGMFRQASDREAVYACFTFSHWLVYANSAANP 1 1 YNF 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I I | | | | | 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 
MWL L VFAL C YL P I S VLN VL KRVFGM FRQAS D REAVYAC FT F S HWLVYAN S AAN P 1 1 YN F 


360 


Db 


301 


360 


Qy 


361 


LSGKFREQFKAAFSCCLPGLGPCGSLKAPSPRSSASHKSLSLQSRCSVSKISEHWLTSV 
1 1 N 1 1 1 II 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 | | | | | : || || | | | | | | | | 
LSGKFREQFKAAFSCCLPGLGPCGSLKAPSPRSSASHKSLSLQSRCSISKISEHWLTSV 


420 


Db 


361 


420 


Qy 


421 


TTVLP 425 





1 1 1 1 1 



Db 



421 TTVLP 425 



RESULT 7 
AAG78345 

ID AAG78345 standard; protein; 425 AA. 
XX 

AC AAG7 8 345; 
XX 

DT 22-JAN-2002 (first entry) 
XX 

DE Human HFGAN72X G coupled receptor polypeptide. 
XX 

KW Antibacterial; fungicide; virucide; protozoacide; anti-HIV; analgesic; 

KW cytostatic; nootropic; antiparkinsonian; cardiant; antiulcer; 

KW antiasthmatic; tranquiliser ; neuroleptic; antidepressant; anticonvulsant; 

KW osteopathic; HIV infection; pain; cancer; anorexia; bulimia; 

KW Parkinson f s disease; acute heart failure; hypotension; hypertension; 

KW urinary retention; osteoporosis; angina pectoris; myocardial infarction; 

KW ulcers; asthma; allergy; delirium; dementia; 

KW benign prostatic hypertrophy; anxiety; schizophrenia; manic depression; 

KW dyskinesia; G coupled receptor; HFGAN72X; 7 transmembrane receptor. 
XX 

OS Homo sapiens . 
XX 

PN EP1154019-A2. 
XX 

PD 14-NOV-2001. 
XX 

PF 27-OCT-1997; 2 001EP-00203008 . 
XX 

PR 30-APR-1997; 97US-00846704 . 

PR 27-OCT-1997; 97EP-00308563 . 
XX 

PA (SMIK ) SMITHKLINE BEECHAM CORP. 
XX 

PI Bergsma DJ, Ellis CE; 
XX 

DR WPI; 2002-012659/02. 

DR N-PSDB; AAI64172. 
XX 

PT Nucleic acid encoding the HFGAN72X receptor, useful for diagnosis and 

PT treatment of e.g. infections, cancer, anorexia, bulimia, Parkinson 1 s 

PT disease, and acute heart failure. 
XX 

PS Claim 11; Page 7-8; 24pp; English. 
XX 

CC The present sequence is that of a human HFGAN72X polypeptide encoded by a 

CC cDNA shown in AAI64172. The specification describes a newly isolated 

CC polynucleotide encoding a HFGAN72X G coupled receptor polypeptide. The 

CC protein of the invention has antibacterial, fungicide, virucide, 

CC protozoacide, anti-HIV, cardiant, analgesic, cytostatic, nootropic, 

CC antiparkinsonian, antiulcer, antiasthmatic, tranquiliser, neuroleptic, 

CC antidepressant, anticonvulsant and osteopathic activities. HFGAN72X 

CC polynucleotides (PNs) are used to express HFGAN72X in vivo, to treat 

CC diseases requiring increased activity or expression of HFGAN72X; for 

CC recombinant production of HFGAN72X; diagnose diseases by detecting 



CC mutations in genomic sequences and in chromosome identification and 

CC mapping. HFGAN72X polypeptides are used to raise specific antibodies; as 

CC therapeutic agents; to identify HFGAN72X protein-expressing clones; to 

CC purify HFGAN72X proteins; in vaccines. Cells transformed with HFGAN72X 

CC PNs are used to identify (ant) agonists of HFGAN72X, useful 

CC therapeutically. Nucleic acids that inhibit expression of HFGAN72X and 

CC polypeptides that compete with ligands for binding to HFGAN72X proteins 

CC are also useful therapeutically and diagnos tically . HFGAN72X-related 

CC diseases include infections (bacterial, viral, fungal or protozoal, 

CC particularly HIV-1 or -2); pain; cancer; anorexia; bulimia; Parkinson's 

CC disease; acute heart failure; hypotension; hypertension; urinary 

CC retention; osteoporosis; angina pectoris; myocardial infarction; ulcers; 

CC asthma; allergy; benign prostatic hypertrophy; anxiety; schizophrenia; 

CC manic depression; delirium; dementia; severe mental retardation and 

CC dyskinesias 

XX 

SQ Sequence 425 AA; 



Query Match 99.8%; Score 2214; DB 5; Length 425; 

Best Local Similarity 99.5%; Pred. No. 2.2e-220; 

Matches 423; Conservative 1; Mismatches 1; Indels 0; Gaps 0; 



Qy 


i 


MEPSATPGAQMGVPPGSREPSPVPPDYEDEFLRYLWRDYLYPKQYEWVLIAAYVAVFWA 

M 1 1 1 1 1 1 1 1 I 1 1 II M 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I I I | M 1 1 1 1 1 1 1 1 1 1 1 1 || 

MEPSATPGAQMGVPPGSREPSPVPPDYEDEFLRYLWRDYLYPKQYEWVXIAAYVAVFWA 


60 


Db 


i 


60 


Qy 


61 


LVGNTLVCIAWRNHHMRTWNYFIWLSIJUDVLVTAICLPASLLVDITESWLFGHALCK 

1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 II 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 

LVGNTLVCIAWRNHHMRTVTNYFIWLSIADVLVTAICLPASLLVD1TESWLFGHALCK 


120 


Db 


61 


120 


Qy 


121 


VIP Y LQ AVS VS VAVLT L S F I AL D RW YAI CHPLLFKS T AR RARG S I L G I WAVS LAI MVP Q A 180 

1 1 1 1 1 II 1 1 1 1 M 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 

VIPYLQAVSVSVAVLTLSFIALDRWYAICHPLLFKSTARRARGSILGIWAVSLAIMVPQA 180 


Db 


121 


Qy 


181 


AVMECSSVXPELANRTRLFSVCDERWADDLYPKIYHSCFFIWYLAPLGLMAMAYFQIFR 

1 1 1 1 1 1 II 1 1 1 1 1 | || | | | | || M || | | | 

AVMECSSVLPELANRTRLFSVCDERWADDLYPKIYHSCFFIVTYLAPLGLMAMAYFQIFR 


240 


Db 


181 


240 


Qy 


241 


KLWGRQIPGTTSALVRNWKRPSDQLGDLEQGLSGEPQPRARAFLAEVKQMRARRKTAKML 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 | 1 1 1 1 1 I 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 

KLWGRQIPGTTSALVRNWKRPSDQLGDLEQGLSGEPQPRGRAFLAEVKQMRARRKTAKML 


300 


Db 


241 


300 


Qy 


301 


MWLLVFALCYLPISVLNVLKRVFGMFRQASDREAVYACFTFSHWLVYANSAANPIIYNF 

1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 
MWLLVFALC YL P I S VLNVLKRVFGMFRQAS DREAVYAC FT FS HWLVYAN SAAN P 1 1 YN F 


360 


Db 


301 


360 


Qy 


361 


LSGKFREQFKAAFSCCLPGLGPCGSLKAPSPRSSASHKSLSLQSRCSVSKISEHWLTSV 
1 1 1 1 1 1 1 1 1 1 1 1 I I I I I I I | | | || | | | | | | || | | || || : | | | |, | | | | | | | 

LSGKFREQFKAAFSCCLPGLGPCGSLKAPSPRSSASHKSLSLQSRCSISKISEHWLTSV 


420 


Db 


361 


420 


Qy 


421 


TTVLP 425 
Mill 

TTVLP 425 




Db 


421 





RESULT 8 
ABP81941 



G protein-coupled receptor; GPCR; antigenic peptide; gene therapy; 
G protein-coupled receptor modulator; antibody; immune- related disease; 
growth-related disease; cell regeneration-related disease; AIDS; cancer; 
immunological-related cell proliferative disease; autoimmune disease; 
Alzheimer's disease; atherosclerosis; infection; osteoarthritis; allergy; 
osteoporosis; cardiomyopathy; inflammation; Crohn's disease; diabetes; 
graft versus host disease; Parkinson's disease; multiple sclerosis; pain; 
psoriasis; anxiety; depression; schizophrenia; dementia; memory loss; 
mental retardation; epilepsy; asthma; tuberculosis; obesity; nausea; 
hypertension; hypotension; renal disorder; rheumatoid arthritis; trauma; 



ID ABP81941 standard; protein; 425 AA. 
XX 

AC ABP81941; 
XX 

DT 04-MAR-2003 (first entry) 
XX 

DE Human orexin receptor 1 protein SEQ ID NO: 368. 
XX 
KW 
KW 
KW 
KW 
KW 
KW 
KW 
KW 
KW 
KW 

KW ulcer 
XX 

OS Homo sapiens. 
XX 

PN WO200261087-A2. 
XX 

PD 08-AUG-2002. 
XX 

PF 19-DEC-2001; 2 001WO-US050107 . 
XX 

PR 19-DEC-2000; 2000US-0257144P . 
XX 

PA (LIFE-) LIFESPAN BIOSCIENCES INC. 
XX 

PI Burmer GC, Roush CL, Brown JP; 
XX 

DR WPI; 2003-046718/04. 
DR N-PSDB; ABZ42789. 
XX 

PT New isolated antigenic peptides e.g., for G protein-coupled receptors 

PT (GPCR), useful for diagnosing and designing drugs for treating conditions 

PT in which GPCRs are involved, e.g. AIDS, Alzheimer's disease, cancer or 

PT autoimmune diseases. 

XX 

PS Disclosure; Fig 1; 523pp; English. 
XX 

CC The present invention describes antigenic peptides (I) comprising: (a) 
CC any one of 1601 sequences (see ABP82019 to ABP83619) of 12-24 amino 

acids. Also described: (1) an assay for the detection of a particular G 
CC protein-coupled receptor (GPCR) or a candidate polypeptide in a sample; 
CC and (2) an isolated antibody having high specificity and high affinity or 
CC avidity for a particular GPCR. (I) can be used as GPCR modulators and in 
CC gene therapy. The antigenic peptides for GPCRs are useful in detecting an 
antibody against a particular GPCR, and in the production of specific 
antibodies. The peptides and antibodies are also useful for detecting the 
CC presence or absence of corresponding GPCRs. The antigenic peptides for 
CC GPCRs and antibodies are useful for diagnosing and designing drugs for 
CC treating immune-related diseases, growth-related diseases, cell 
CC regeneration-related disease, immunological-related cell proliferative 



CC diseases, or autoimmune diseases, e.g. AIDS, Alzheimer's disease, 
atherosclerosis, bacterial, fungal, protozoan or viral infections, 
osteoarthritis, osteoporosis, cancer, cardiomyopathy, chronic and' acute 
inflammation, allergies, Crohn's disease, diabetes, graft versus host 
disease, Parkinson's disease, multiple sclerosis, pain, psoriasis, 
anxiety, depression, schizophrenia, dementia, mental retardation, memory 
loss, epilepsy, asthma, tuberculosis, obesity, nausea, hypertension, 
hypotension, renal disorders, rheumatoid arthritis, trauma, ulcers, or 
any other disorder in which GPCRs are involved. The antibodies may be 
used in immunoassays and immunodiagnosis . ABZ42523 to ABZ42869 encode 
GPCR proteins given in ABP81675 to ABP82018, which are used in the 
exemplification of the present invention 



CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
XX 

SQ Sequence 425 AA 



Query Match 99.8%; Score 2214; DB 6; Length 425; 

Best Local Similarity 99.5%; Pred. No. 2.2e-220; 

Matches 423; Conservative 1; Mismatches 1; Indels 0; Gaps 



Qy 


1 


MEPSATPGAQMGVPPGSREPSPVPPDYEDEFLRYLWRDYLYPKQYEWVLIAAYVAVFWA 
1 1 1 1 1 1 1 1 1 1 1 1 1 1 I | | | | | | 1 1 1 1 1 1 I I | | | | | | | | | | | | | | | | | | | | | | | | M | | | | | 

MEPSATPGAQMGVPPGSREPSPVPPDYEDEFLRYLWRDYLYPKQYEWVLIAAYVAVFWA 


60 


Db 


1 


60 


Qy 


61 


LVGNTLVCIAWRNHHMRTWNYFIWLSLADVLVTAICLPASLLVDITESWLFGHALCK 

1 1 M 1 1 1 1 1 1 1 1 1 1 I I | | | | | 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 I | | | | | | | | | | | | | 

IjV^ivi 1LVCLAVWRNHHMRTVTNYF 


120 


Db 


61 


120 


Qy 


121 


VIPYLQAVSVSVAVLTLSFIALDRWYAICHPLLFKSTARRARGSILGIWAVSLAIMVPQA 

" M 1 1 1 1 1 1 I 1 1 II 1 1 II 1 1 1 I I I I 1 1 1 1 I I I | 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 II 

VIPYLQAVSVSVAVLTLSFIALDRWYAICHPLLFKSTARRARGSILGIWAVSLAIMVPQA 


180 


Db 


121 


180 


Qy 


181 


AVTVIECSSVLPELANRTRLFSVCDERWADDLYPKIYHSCFFIVTYLAPLGLMAMAYFQIFR 
M 1 1 M 1 1 1 1 1 1 || | | | | | | | | | || 1 1 1 1 1 1 I I I I | | | | II I M 1 1 1 1 I 

AVMECSSVLPELANRTRLFSVCDERWADDLYPKIYHSCFFI VTYLAPLGLMAMAYFQIFR 


240 


Db 


181 


240 


Qy 


241 


KLWGRQIPGTTSALVRNWKRPSDQLGDLEQGLSGEPQPRARAFLAEVKQMRARRKTAKML 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 II 1 1 1 1 1 1 1 I I I | | | | 1 II 1 II 1 1 

KLWGRQIPGTTSALVRNWKRPSDQLGDLEQGLSGEPQPRGRAFLAEVKQMRARRKTAKML 


300 


Db 


241 


300 


Qy 


301 


MWLL VFAL C Y L P I S VLNVL KRVFGMFRQAS D REAVYAC FT FS HW LVYAN S AAN P 1 1 YN F 

1 1 1 1 1 H 1 1 1 II 1 1 1 M 1 1 M 1 1 1 M 1 i 1 M II 1 1 1 1 1 1 1 1 I I I I | | | | | | M || 

MWLLVFALC YLP I SVLNVLKRVFGMFRQAS D REAVYAC FT FS HWLVYANS AANP 1 1 YNF 


360 


Db 


301 


360 


Qy 


361 


LSGKFREQFKAAFSCCLPGLGPCGSLKAPSPRS SASHKSLSLQSRCSVSKISEHWLTSV 
1 1 1 1 1 1 1 1 1 N M 1 II 1 1 1 1 M 1 II 1 II II I I | | | | | || | || || | || : | | | || | | | | || | 
LSGKFREQFKAAFSCCLPGLGPCGSLKAPSPRSSAoHKSLSLQSRCSISKISEHWLTSV 


420 


Db 


361 


420 


Qy 


421 


TTVLP 425 




Db 


421 


Mill 

TTVLP 425 





RESULT 9 
ABB56378 

ID ABB56378 standard; protein; 425 AA. 
XX 

AC ABB56378; 



18-FEB-2002 (first entry) 

Non-endogenous human GPCR protein, SEQ ID NO: 54 9. 

Human; G protein-coupled receptor; GPCR; non-endogenous; mutant; 
constitutively activated GPCR; agonist; disease. 



XX 
DT 
XX 
DE 
XX 
KW 
KW 
XX 

OS Homo sapiens. 
OS Synthetic. 
XX 

PN WO200177172-A2. 
XX 

PD 18-OCT-2001. 
XX 

PF 05-APR-2001; 2001WO-US011098 . 
XX 

PR 07-APR-2000; 2000US-0195747P . 
XX 

PA (AREN-) ARENA PHARM INC. 
XX 

PI Lehmann-Bruinsma K, Liaw CW, Lin I; 
XX 

DR WPI; 2001-648759/74. 
DR N-PSDB; ABI98014. 
XX 
PT 
PT 



Identifying agonists of G protein-coupled receptors (GPCRs) for use in 
disease treatment, comprises contacting candidate compounds with versions 

PT of GPCRs. 
XX 

PS Claim 1; Page 350-351; 394pp; English. 
XX 

CC The invention relates to G protein-coupled receptors (GPCRs) for which 

CC the endogenous ligand has been identified. Non-endogenous constitutively 

CC activated versions of known GPCRs are used in the invention for the 

CC direct identification of candidate compounds as receptor agonists, 

CC inverse agonists or partial agonists. Such agonists are useful as' 

CC therapeutic agents for diseases or disorders associated with GPCRs. The 

CC present sequence is a non-endogenous version of a known human GPCR 

XX 

SQ Sequence 425 AA; 



Query Match 99.5%; Score 2209; DB 4; Length 425; 

Best Local Similarity 99.3%; Pred. No. 7.4e-220; 

Matches 422; Conservative 1; Mismatches 2; Indels 0; Gap 



Qy 


1 


ME P S AT PGAQMGVP PGS REP S P VP PD YEDE FLRYLWRD YL YPKQ YEWVLI AAYVAVFVVA 
> 1 1 1 1 1 1 1 1 1 1 1 1 1 II II 1 1 1 1 1 1 I I I I | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | 
MEPSATPGAQMGVPPGSREPSPVPPDYEDEFLRYLWRDYLYPKQYEWVLIAAYVAVFWA 


60 


Db 


1 


60 


Qy 


61 


LVGNTLVCLAWRNHHMRTVTNYFIVNLSLADVLVTAICLPASLLVDITESWLFGHALCK 

1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 I 1 I I I I I | | | 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

LVGNTLVCLAWRNHHMRTVTNYFIWLSLADVLVTAICLPASLLVDITESWLFGHALCK 


120 


Db 


61 


120 


Qy 


121 


VIPYLQAVSVSVAVLTLSFIALDRWYAICHPLLFKSTARRARGSILGIWAVS LAIMVPQA 
1 1 1 1 1 1 1 1 1 1 1 I I I I I | I | | | | 1 II 1 II 1 I 1 I I I | | | | | | | | | | | | | 1 1 1 1 1 1 1 1 1 1 1 1 I 
VI P YLQAVSVSVAVLTLS FI ALDRWYAI CHPLLFKSTARRARGS I LGIWAVS LAIMVPQA 


180 


Db 


121 


180 



Qy 181 AVMECSSVLPELANRTRLFSVCDERWA^ 240 

' I I I I I I I I I I I I I I I | | | | | | | | | M I I I I I I I I I M I I I I I I I | | | | | U Ill 

Db 181 AVMECSSVLPEI^RTRLFSVCDERWADDLYPKIYHSCFFIVTYLAPLGLMAMAYPQIFR 240 

QY 241 KLWGRQIPGTTSALVRNWKRPSDQLGDLEQGLSGEPQPRARAFLAEVKQMRARRKTAKML 300 

I I I I I I I I I I I I I I I I I I I I I I | | | | | | | | | | | | | | | | | I I I I | | | | | | | | I | | | (I) 
Db 241 KLWGRQIPGTTSALVRNWKRPSDQLGDLEQGLSGEPQPRGRAFIAEVKQMRARRKTKKML 300- 

Qy 301 MWLLVFALCYLPISVLNVLKRVFGMFRQASDREAVYACFTFSHWLVYANSAANPIIYNF 360 

_ I I I I I I I I I I I I I II || || | | | | | | | | | | | | | M I I II I I I I I I I I I I | I | || | | 

Db 301 MWL LVFAL C Y L P I S VLN VL K RVFGM FRQ AS D REAVYAC FT FS HW L VYAN S AAN P 1 1 YN F 360 

Qy 361 LSGKFREQFKAAFSCCLPGLGPCGSLKAPSPRSSASHKSLSLQSRCSVSKISEHWLTSV 420 

I I I I I I I I I I I I I I I I I I I I I M I I I I : I I I | | | | | | | | | 

Db 361 LSGKFREQFKAAFSCCLPGLGPCGSLKAPSPRSSASHKSLSLQSRCSISKISEHWLTSV 42 0 

Qy 421 TTVLP 425 

I I I I I 

Db 421 TTVLP 425 



RESULT 10 
AAE04740 

ID AAE04740 standard; protein; 425 AA. 
XX 

AC AAE04740; 
XX 

DT 10-SEP-2001 (first entry) 
XX 

DE Cynomolgous Monkey Orexin 1 Receptor. 
XX 
KW 



Cynomolgous monkey; Orexin 1 Receptor; 7 Transmembrane Receptor family; 
KW 7TM; gene therapy; vaccine; microbial infection; HIV-1; HIV-2; pain; 
KW cancer; diabetes; obesity; anorexia; bulimia; urinary retention; 

Parkinson's disease; acute heart failure; hypotension; hypertension; 
osteoporosis; angina pectoris; myocardial infarction; stroke; ulcer; 
asthma; allergy; benign prostatic hypertrophy; migraine; vomiting; 
psychotic disorder; neurological disorder; anxiety; schizophrenia; 
manic depression; depression; delirium; dementia; mental retardation; 
dyskinesia; Huntington's disease; Gilles de la Tourette ■ s syndrome. 



KW 
KW 
KW 
KW 
KW 
KW 
XX 

OS Macaca f ascicularis . 
XX 

PN WO200140259-A2. 
XX 

PD 07-JUN-2001. 
XX 

PF 04-DEC-2000; 2000WO-US032849 . 
XX 

PR 02-DEC-1999; 99US-0168553P . 
PR 28-NOV-2000; 2000US-00723781 . 
XX 

PA (SMIK ) SMITHKLINE BEECHAM CORP. 
PA (SMIK ) SMITHKLINE BEECHAM PLC. 
XX 

PI Ellis CE; 



XX 

DR WPI; 2001-408276/43. 

DR N-PSDB; AAD09335. 
XX 

PT Novel Cynomolgous Monkey Orexin 1 Receptor polypeptides, for treating 

PT infections, pain, cancer, diabetes, obesity, asthma, schizophrenia, 

PT hypertension, urinary retention, Parkinson's disease and stroke. 
XX 

PS Claim 2; Page 28; 33pp; English. 
XX 

CC The present sequence is Cynomolgous Monkey Orexin 1 Receptor which is 

structurally related to members of 7 Transmembrane Receptor (7TM) family. 

CC The Orexin 1 Receptor polypeptide and polynucleotide are useful for 

treating bacterial, fungal, protozoan and viral infections, particularly 

CC infections caused by HIV-1 or HIV-2, pain, cancer, diabetes, obesity, 

CC anorexia, bulimia, Parkinson's disease, acute heart failure, hypotension, 

CC hypertension, urinary retention, osteoporosis, angina pectoris, 

CC myocardial infarction, stroke, ulcers, asthma, allergies, benign 

CC prostatic hypertrophy, migraine, vomiting, psychotic and neurological 

CC disorders including anxiety, schizophrenia, manic depression, depression, 

CC delirium, dementia and severe mental retardation, and dyskinesias, such 

CC as Huntington's disease or Gilles de la Tourette's syndrome. The 

CC polypeptide is also useful for structure-based design of its agonist, 

CC antagonist or inhibitor. The polynucleotide is useful for chromosome' 

CC localisation studies and in gene therapy. The Orexin 1 Receptor 

CC polypeptide and polynucleotide are also useful as vaccines 
XX 

SQ Sequence 425 AA; 



Query Match 98.4%; Score 2183; DB 4 ; Length 425; 

Best Local Similarity 98.1%; Pred. No. 3.7e-217; 

Matches 417; Conservative 4; Mismatches 4; Indels 0; Gaps 



Qy 


i 


MEPSATPGAQMGVPPGSREPSPVPPDYEDEFLRYLWRDYLYPKQYEWVLIAAYVAVFWA 

MMIIIIMI II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II II 1 1 1 1:11 

MEPSATPGAQMRVPTGSREPSPVPPDYEDEFLRYLWRDYLYPKQYEWVLIAAYVAVFLVA 


60 


Db 


l 


60 


Qy 


61 


LVGNTLVCIAWRNHHMRTWNYFIWLSL^ 

N 1 1 1 1 1 1 1 1 1 1 1 II II 1 MMIIIIMI 

LVGNTLVCIAWRNHHMRTVTNYFIWLSIJyDVLVTAICLPVSLLVDITESWLFGHTVLCK 


120 


Db 


61 


120 


Qy 


121 


VI P Y L Q AVS VS VAVLT L S F I AL D RW YAI CHPLLFKS T ARRARG S I L G I WAVS LAI MVP QA 

1 1 M 1 II II 1 1 1 1 1 II 1 1 1 1 II M 1 II II II 1 M II 1 1 1 1 1 II 1 1 1 M 1 1 1 II 1 : II II 1 

VIP Y LQ AVS VS VAVLT L S F I AL DRW YAI CHPLLFKS T ARRARG S I L G I WAVS LAVMVP QA 


180 


Db 


121 


180 


Qy 


181 


AVMECSSVLPELAlvRTRLFSVCT^ 

1 1 M II 1 1 II II 1 || | M 1 II II II 1 1 1 || II 1 1 1 M 1 1 II 1 II 

AVMECSSVLPELANRTRLFSVCDERWADDLYPKIYHSCFFI VT 


240 


Db 


181 


240 


Qy 


241 


KLWGRQIPGTTSALVRNWKRPSDQLGDLEQGLSGEPQPRARAFIAEVKQMRARRKTAKML 
1 1 1 'I N M II 1 1 II 1 1 II 1 M 1 1 M 1 II 1 II 1 1 : II I I I I M 1 II 1 1 1 1 1 1 II II 1 II 1 
KLWGRQIPGTTSALVRNWKRPSDQLGDLEQGLSGQPQPRARAFLAEVKQMRARRKTAKML 


300 


Db 


241 


300 


Qy 


301 


MWL L VFAL C Y L P I S VLNVL KRVFGM F RQAS D REAVYAC FT F S HW LVYAN S AAN P 1 1 YN F 

1 1 1 1 1 1 1 1 1 H 1 II 1 1 M II 1 II II II 1 1 II II II 1 1 1 1 1 1 II 1 Ml Ill 

MWLLVFALCYLPISVLNVLKRVFGMFRQASDREAVYACFTFSHWLVYANSAANPIIYNF 


360 


Db 


301 


360 



Qy 
Db 

Qy 

Db 



361 LSGKFREQFKAAFSCCLPGLGPCGSLKAPSPRSSASHKSLSLQSRCSVSKISEHWLTSV 420 

I I I I M I I I I I I I I I I I I i I I I I Ml II i II I il I I I I I || I I ! j M M:| || I IM I I 
361 LSGKFREQFKAAFSCCLPGPGPCGSLKAPSPRSSASHKSLSLQSRCSVSKLSEHWLTSV 420 

421 TTVLP 425 

I I II I 
421 TTVLP 425 



AAW06124; 

22-FEB-1997 (first entry) 
Neuropeptide receptor. 

Human; neuropeptide receptor; drug screening; receptor-agonist; 
receptor-antagonist; anorectic; antitumour; anticholesterolemic; 
neuroprotective; anticonvulsant; hypotensive; sedative; diagnostic; 
gene therapy. 



RESULT 11 
AAW06124 

ID AAW06124 standard; protein; 402 AA. 
XX 
AC 
XX 
DT 
XX 
DE 
XX 
KW 
KW 
KW 
KW 
XX 
OS 
XX 
FH 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
XX 
PN 
XX 
PD 
XX 
PF 
XX 
PR 
XX 
PA 
XX 
PI 
XX 
DR 
DR 
XX 



Homo sapiens, 
Key 

Domain 
Domain 
Domain 
Domain 
Domain 
Domain 
Domain 

W09634877-A1. 
07-NOV-1996. 
05-MAY-1995; 
05-MAY-1995; 



Location/Qualifiers 
47. .72 

/note= "Transmembrane region-1" 
83. .106 

/note= "Transmembrane region-2" 
112. .142 

/note= "Transmembrane region-3" 
163. .189 

/note= "Transmembrane region-4" 
214. .239 

/note= "Transmembrane region-5" 
299. .327 

/note= "Transmembrane region-6" 
335. .363 

/note= "Transmembrane region-7" 



95WO-US005616. 
95WO-US005616. 



(HUMA-) HUMAN GENOME SCI INC. 

Soppet DR, Li Y, Rosen CA; 

WPI; 1996-506094/50. 
N-PSDB; AAT42826. 



PT Human neuro-peptide receptor polypeptide ( s ) - used to identify 

antagonists and agonists to such polypeptide ( s ) , e.g. in the treatment of 
obesity, Alzheimer 1 s disease, epilepsy, etc. 



PT 
PT 
XX 
PS 
XX 

cc 
cc 



Claim 1; Page 49-50; 77pp; English, 



The sequence represents a human adult hypothalamus neuropeptide receptor, 
structurally related to the G-protein-coupled receptor family. Splice 
CC variants are given in AAW06125-26. The receptor contains 7 transmembrane 
CC regions. The receptor may be produced in recombinant form and used in a 
CC drug screening assay for isolation of receptor-agonists and -antagonists, 
CC which may be used as anorectic, antitumour, anticholesterolemic, 

neuroprotective, anticonvulsant, hypotensive or sedative drugs, etc. The 
encoding DNA may be used in genetic disease diagnosis or gene therapy. 
The receptor itself and its corresponding antibody may also be used in 
CC therapy and diagnosis 
XX 

SQ Sequence 402 AA; 

Query Match 94.5%; Score 2097; DB 2; Length 402; 

Best Local Similarity 99.5%; Pred. No. 2.8e-208; 

Matches 400; Conservative 0; Mismatches 2; Indels 0; Gaps 0; 



CC 
CC 
CC 



Qy 


1 


MEPSATPGAQMGVPPGSREPSPVPPDYEDEFLRYLWRDYLYPKQYEWVLIAAYVAVFWA 

1 1 1 1 1 1 1 M 1 1 1 I I I I I I | 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 

MEPSATPGAQMGVPPGSREPSPVPPDYEDEFLRYLWRDYLYPKQYEWVLIAAWAVFVVA 


60 


Db 


1 


60 


Qy 


61 


LVGNTLVCIAWRNHHMRTVTNYFIWLSLADVLVTAICLPASLLVDITESWLFGHALCK 

M 1 1 1 II 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 M 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

LVGNTLVCIAWRNHHMRTVTNYFIWLSIADVLWAICLPASLLVT)ITESWLFGHALCK 


120 


Db 


61 


120 


Qy 


121 


VI P YLQAVS VS VAVLT L S F I AL D RW YAI CHPLLFKS T ARRARG S I L G I WAVS LAI MVP Q A 

1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 II 1 1 II 

VI P YLQAVS VS VAVL T L S F I AL D RW YAI CHPLLFKS T ARRARG S I L G I WAVS LAI MVP Q A 


180 


Db 


121 


180 


Qy 


181 


AVIVIECSSVLPELANRTRLFSVCDERWADDLYPKIYHSCFFIVTYLAPLGLMAMAYFQIFR 

M 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 II 1 1 1 1 II 1! 

AVTVIECSSVLPELANRTRLFSVCHERWADDLYPKIYHSCFFIVTYLAPLGLMAMAYFQIFR 


240 


Db 


181 


240 


Qy 


241 


KLWGRQIPGTTSALVRNWKRPSDQLGDLEQGLSGEPQPRARAFLAEVKQMRARRKTAKML 

M 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I I M 1 1 1 1 1 1 1 1 1 1 I I I I I | 

KLWGRQIPGTTSALVRNWKRPSDQLGDLEQGLSGEPQPRGRAFLAEVKQMRARRKTAKML 


300 


Db 


241 


300 


Qy 


301 


MWLLVFALCYLPISVLNVLKRVFGMFRQASDREAVYACFTFSHWLVYANSAANPIIYNF 
1 1 1 1 M 1 1 1 1 1 1 1 | | | I | | | | M 1 1 1 1 1 1 | | | | M | | | | | | | I | | | | | | | | | | | | | | | | | 
MWL L VFAL C YL P I S VL N VL K RVF GM F RQ AS D R JiiAVYAC FT F S HW L VYAN S AAN P 1 1 YN F 


360 


Db 


301 


360 


Qy 


361 


LSGKFREQFKAAFSCCLPGLGPCGSLKAPSPRSSASHKSLSL 4 02 

iiiiii mi 1 1 1 ii 1 1 1 1 1 1 1 1 1 1 n 1 1 M 1 1 1 M 

LSGKFREQFKAAFSCCLPGLGPCGSLKAPSPRSSASHKSLSL 4 02 




Db 


361 





RESULT 12 
AAG7 8 34 6 

ID AAG78346 standard; protein; 401 AA. 
XX 

AC AAG78346; 



cc 
cc 
cc 



Human HFGAN72X G coupled receptor partial protein sequence. 

Antibacterial; fungicide; virucide; protozoacide; anti-HIV; analgesic; 
cytostatic; nootropic; antiparkinsonian; cardiant; antiulcers- 
antiasthmatic; tranquiliser; neuroleptic; antidepressant; anticonvulsant ; 
osteopathic; HIV infection; pain; cancer; anorexia; bulimias- 
Parkinson's disease; acute heart failure; hypotension; hypertension; 
urinary retention; osteoporosis; angina pectoris; myocardial infarctions- 
ulcers; asthma; allergy; delirium; dementias- 
benign prostatic hypertrophy; anxiety; schizophrenia; manic depression; 
dyskinesia; G coupled receptor; HFGAN72X; 7 transmembrane receptor. 



XX 

DT 22-JAN-2002 (first entry) 
XX 
DE 
XX 
KW 
KW 
KW 
KW 
KW 
KW 
KW 
KW 
KW 
XX 

OS Homo sapiens. 
XX 

FH Ke Y Location/Qualifiers 
FT Misc-dif ference 280 

FT /note= "Encoded by GGC" 

XX 

PN EP1154019-A2. 
XX 

PD 14-NOV-2001. 
XX 

PF 27-OCT-1997; 2001EP-00203008 . 
XX 

PR 30-APR-1997; 97US-00 84 6704 . 
PR 27-OCT-1997; 97EP-00308563 . 
XX 

PA (SMIK ) SMITHKLINE BEECHAM CORP. 
XX 

PI Bergsma DJ, Ellis CE; 
XX 

DR WPI; 2002-012659/02. 
DR N-PSDB; AAI64173. 
XX 

PT Nucleic acid encoding the HFGAN72X receptor, useful for diagnosis and 
PT treatment of e.g. infections, cancer, anorexia, bulimia, Parkinson's 
PT disease, and acute heart failure. 
XX 

PS Disclosure; Page 10; 24pp; English. 
XX 



The present sequence is that of a human HFGAN72X polypeptide encoded by a 
CC cDNA shown in AAI64173. The specification describes a newly isolated 
CC polynucleotide encoding a HFGAN72X G coupled receptor polypeptide. The 
CC protein of the invention has antibacterial, fungicide, virucide, 
CC protozoacide, anti-HIV, cardiant, analgesic, cytostatic, nootropic, 
CC antiparkinsonian, antiulcer, antiasthmatic, tranquiliser, neuroleptic, 
CC antidepressant, anticonvulsant and osteopathic activities. HFGAN72X 
CC polynucleotides (PNs) are used to express HFGAN72X in vivo, to treat 
CC diseases requiring increased activity or expression of HFGAN72X; for 
CC recombinant production of HFGAN72X; diagnose diseases by detecting 
mutations in genomic sequences and in chromosome identification and 
mapping. HFGAN72X polypeptides are used to raise specific antibodies; as 
therapeutic agents; to identify HFGAN72X protein-expressing clones; to 
purify HFGAN72X proteins; in vaccines. Cells transformed with HFGAN72X 



CC PNs are used to identify (ant) agonists of HFGAN72X, useful 

CC therapeutically. Nucleic acids that inhibit expression of HFGAN72X and 

CC polypeptides that compete with ligands for binding to HFGAN72X proteins 

CC are also useful therapeutically and diagnostically . HFGAN72X- related 

CC diseases include infections (bacterial, viral, fungal or protozoal, 

CC particularly HIV-1 or -2); pain; cancer; anorexia; bulimia; Parkinson's 

CC disease; acute heart failure; hypotension; hypertension; urinary 

CC retention; osteoporosis; angina pectoris; myocardial infarction; ulcers; 

CC asthma; allergy; benign prostatic hypertrophy; anxiety; schizophrenia; 

CC manic depression; delirium; dementia; severe mental retardation and 

CC dyskinesias 

XX 

SQ Sequence 401 AA; 



Query Match 94.3%; Score 2093.5; DB 5; Length 401; 

Best Local Similarity 99.8%; Pred. No. 6.5e-208; 

Matches 401; Conservative 0; Mismatches 0; Indels 1; Gaps 



Qy 


1 


MEPSATPGAQMGVPPGSREPSPVPPDYEDEFLRYLWRDYLYPKQYEWLIAAWAVFWA 

Ml Ml Ml MMM! II 1 III 1 1 1 1 1 1 II 1 1 1 MM III III 1 1 1 1 II 1 II II 1 MM 1 

MEPSATPGAQMGVPPGSREPSPVPPDYEDEFLRYLWRDYLYPKQYEWVLIAAYVAVFWA 


60 


Db 


1 


60 


Qy 


61 


LVGNTLVCIAWRNHHMRTVTNYFIWLSIA.DVLVTAICLPASLLVDITESWLFGHALCK 

1 1 1 1 1 1 1 1 1 1 I I I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I I | | | | I | | | | 1 1 I I I | | | | 

LVGNT LVC LAVWRNHHMRT VTN YFI VNL S LADVLVT AI C L PAS LLVD I T E S WL FGHAL CK 


120 


Db 


61 


120 


Qy 


121 


VI P Y LQAVS VS VAVLT L S F I AL D RW YAI CHPLLFKS T ARRARG S I L G I WAVS LAI MVP QA 

1 1 M 1 1 1 1 1 1 1 1 1 1 1 I I I | | | | | | M 1 1 1 1 1 1 1 1 I I | | 1 1 1 1 1 1 1 I I I I | | I | | | 

VIPYLQAVSVSVAVLTLSFIALDRWYAICHPLLFKSTARRARGSILGIWAVS LAIMVPQA 


180 


Db 


121 


180 


Qy 


181 


AVMECSSVLPEIANRTRLFSVCDERWADDLYPKIYHSCFFIVTYLAPLGLMAMAYFQIFR 

I'll N 1 1 1 1 1 1 1 I 1 1 1 1 1 1 1 1 1 1 1 1 | | | | | | | | | M 1 1 1 1 1 1 1 1 1 MM 

AVMECSSVLPELANRTRLFSVCDERWADDLYPKIYHSCFFIVTYLAPLGLMAMAYFQIFR 


240 


Db 


181 


240 


Qy 


241 


KLWGRQIPGTTSALVRNWKRPSDQLGDLEQGLSGEPQPRARAFLAEVKQMRARRKTAKML 
' 1 1 1 II 1 N 1 II M N 1 1 1 N M 1 M II 1 1 1 M 1 1 M M 1 1 II M II 1 1 1 1 I | | 

KLWGRQIPGTTSALVRNWKRPSDQLGDLEQGLSGEPQPR-RAFLAEVKQMRARRKTAKML 


300 


Db 


241 


299 


Qy 


301 


MWLLVFAL CYL P I S VLNVLKRVFGMFRQAS DREAVYACFT FS HWLVYAN SAAN P 1 1 YN F 

1 1 1 1 1 1 1 1 1 1 M II II 1 M M II M 1 II | | || || || || | || 1 II 1 1 

MWLLVFALCYL P I S VLNVLKRVFGMFRQAS DREAVYACFT FS HWLVYAN SAANP 1 1 YNF 


360 


Db 


300 


359 


Qy 


361 


LSGKFREQFKAAFSCCLPGLGPCGSLKAPSPRSSASHKSLSL 4 02 
1 1 1 II II 1 1 1 1 1 1 M M 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 II M II M 
LSGKFREQFKAAFSCCLPGLGPCGSLKAPSPRSSASHKSLSL 401 




Db 


360 





RESULT 13 
AAB47300 

ID AAB47300 standard; protein; 427 AA. 
XX 

AC AAB47300; 
XX 

DT 22-AUG-2001 (first entry) 
XX 

DE Dog orexin 1 receptor. 
XX 



KW 
KW 



Beagle; dog; orexin 1 receptor; human immunodeficiency virus; HIV; pain; 
cancer; diabetes; anorexia; bulimia; asthma; angina pectoris; 



KW Parkinson's disease; acute heart failure; hypotension; schizophrenia; 
KW hypertension; urinary retention; osteoporosis; Huntington's disease; 

KW myocardial infarction; stroke; ulcer; allergy; depression; delirium; 
KW benign prostatic hypertrophy; migraine; vomiting; psychosis; anxiety; 
KW dementia; severe mental retardation; dyskinesia. 
XX 

OS Canis familiaris. 
XX 

PN WO200142268-A1. 
XX 

PD 14-JUN-2001. 
XX 

PF 07-DEC-2000; 2 0 0 OWO-US 033106 . 
XX 

PR 07-DEC-1999; 99US-0169373P . 

PR 06-DEC-2000; 2000US-00169373 . 
XX 

PA (SMIK ) SMITHKLINE BEECHAM CORP. 
PA (SMIK ) SMITHKLINE BEECHAM PLC. 
XX 

PI Ellis CE; 
XX 

DR WPI; 2001-381627/40. 

DR N-PSDB; AAC85974. 
XX 

PT Novel Beagle Orexin 1 receptor polypeptides and polynucleotides for 

PT identifying modulators for use in preventing, and treating psychotic and 

PT neurological disorders, asthma, cancer, diabetes and microbial 

PT infections. 

XX 

PS Claim 1; Page 33; 36pp; English. 
XX 

CC This sequence represents Beagle Orexin 1 receptor. The orexin 1 

CC polypeptide is useful in identifying compounds that may be agonists 

CC and/or antagonists. The compounds identified are useful in preventing and 

CC treating human diseases, including bacterial, fungal, protozoan and viral 

infections, particularly, infections caused by human immunodeficiency 

CC virus (HIV), pain, cancer, diabetes, anorexia, bulimia, asthma, 

CC Parkinson's diseases, acute heart failure, hypotension, hypertension, 

CC urinary retention, osteoporosis, angina pectoris, myocardial infarction, 

CC stroke, ulcer, allergies, benign prostatic hypertrophy, migraine, 

CC vomiting, psychotic and neurological disorders, including anxiety, 

CC schizophrenia, depression, delirium, dementia and severe mental 

CC retardation and dyskinesias, such as Huntington's disease 
XX 

SQ Sequence 427 AA; 



Query Match 94.1%; Score 2087; DB 4; Length 427; 

Best Local Similarity 94.1%; Pred. No. 3.3e-207; 

Matches 402; Conservative 5; Mismatches 18; Indels 2; Gaps 1; 

Qy 1 MEPSATPGAQMGVPPGSRE — PSPVPPDYEDEFLRYLWRDYLYPKQYEWVLIAAYVAVFV 58 

I M I I M M I I I I I M I I I I I M I I I I I I I I I I I I I I I I I I I I I I It I I I I : 
Db 1 MEPSATPGAQTGTPTGGGELSPSLVPPDYEDEFLRYLWRDYLYPKQYEWVLIAAYVAVFL 60 



Qy 


59 


VAL VGNT L VC LAVWRNHHMRT VTN YF I VN L S LADVLVT AI C L PAS LLVD I T E S WL FGHAL 

> 1 1 1 1 1 II II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I | | M 1 1 1 1 1 I | | M 1 1 1 1 1 1 1 1 1 1 1 1 

VALVGNTLVCIJ^WRNHHMRTVTNYFIWLSIADVLVTAICLPASLLVDITESWLFGHTL 


118 


JJD 


61 


120 




11 9 


CKVIPYLQAVSVSVAVLTLSFIALDRWYAICHPLLFKSTARRARGSILGIWAVSLAIMVP 

NIIMMIIIIIIIIIIIIIIIIMIIIIIIIIIMIIIIMI I I I | | | M II 1 : 1 1 1 

CKVIPYLQAVSVSVAVLTLSFIALDRWYAICHPLLFKSTARRARSSILGIWAVSLAVMVP 


178 


nk 

LfD 


TOT 
IZ 1 


180 


Qy 


179 


QAAVMECSSVLPELANRTRLFSVCDERWM^ 

1 M 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 
QAAVMECSSVLPEIANRTRLFSVCDEHWADDLYPKIYHSCFFIVTYLAPLGLMAMAYFQI 


238 


DO 


181 


240 


Qy 


239 


FRKLWGRQIPGTTSALVRNWKRPSDQLGDLEQGLSGEPQPRARAFLAEVKQMRARRKTAK 
M 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 II 1 1 1 1 1 1 1 I | || 1 1 1 1 1 1 1 II 1 1 1 1 1 II I I I I I 
FRKLWGRQIPGTTSALVRNWKRPSDQLEDQGPGLSAEPPPRARAFLAEVKQMRARRKTAK 


298 


LID 


241 


300 


Qy 


299 


MLMWLLVFAL C YL P I S VLNVLKRVFGMFRQAS D REAVYAC FT FS HWLVYAN S AAN P 1 1 Y 

llll = MIMIMIIIIMIIMIIIIIIII:IMIIIIIIIM IIIIIIIIMI 

MLMVLLLVFALC YL P I S VLNVLKRVFGMFRQS S D REAVYAC FT FS HWLVYAN S AAN P 1 1 Y 


358 


Db 


301 


360 


Qy 


359 


NFLSGKFREQFKAAFSCCLPGLGPCGSLKAPSPRSSASHKSLSLQSRCSVSKISEHWLT 
1 1 1 1 l l l I I I I II i i i i i i i i i i i i i i i i i i i i i i i i i i i j i i i i 
1 II t 1 II 1 1 1 1 1 1 1 1 1 1 1 | | 1 | | | | M II 1 II 1 1 1 || | | || | | | | | | | | | : | | | | | | 

NFLSGKFREQFKAAFSCCLPGLGPCGSPKAPSPRSSASHKSLSLHSRCSVSKVPEHWLT 


418 


Db 


361 


420 


Qy 


419 


SVTTVLP 425 
1 1 II 1 1 1 
SVTTVLP 427 




Db 


421 





RESULT 14 
AAU00442 

ID AAU00442 standard; protein; 364 AA. 
XX 

AC AAU00442; 
XX 

DT 18-JUN-2001 (first entry) 
XX 

DE Human neuropeptide receptor N-terminal and C-terminal deletion mutant. 
XX 
KW 



Human; neuropeptide receptor; neuropeptide Y receptor; obesity; 
KW nervous system disorder; hyperprolif erative disorder; diabetes mellitus; 
cardiovascular disorder; autoimmune disorder; infectious disorder; 
eating behaviour disorder; narcolepsy; neurological disease; 
narcotics addiction; nicotine addiction; alcohol addiction; gene therapy; 
protein co-ordinate data; mutant; mutein. 



KW 
KW 
KW 
KW 
XX 

OS Homo sapiens. 
OS Synthetic. 
XX 

FH Ke Y Location/Qualifiers 
FT Protein 1. .364 

FT /note= "Corresponds to wild type neuropeptide receptor 

FT residues 17-380" 

XX 

PN WO200117532-A1. 
XX 

PD 15-MAR-2001. 



A new nucleic acid encoding a human neuropeptide receptor polypeptide, 
useful for preventing, treating or ameliorating obesity, narcolepsy, 
neurological disease and addiction to narcotics, nicotine and alcohol. 



XX 

PF 07-SEP-2000; 2000WO-US024518 . 
XX 

PR 10-SEP-1999; 99US-00393696 . 
XX 

PA (HUMA-) HUMAN GENOME SCI INC. 
XX 

PI Soppet DR, Li Y, Rosen CA; 
XX 

DR WPI; 2001-183276/18. 
XX 
PT 
PT 
PT 
XX 

PS Example 9; Page; 385pp; English. 
XX 

CC The present sequence represents a human neuropeptide receptor mutant 

CC protein which is constructed by the deletion of N-terminal residues 1-16 

CC and C-terminal residues 381-425 of the wild type novel neuropeptide 

CC receptor protein. The novel neuropeptide receptor (AAU00438) shows 

CC sequence homology to the neuropeptide Y receptor. Two splice variants of 

CC the neuropeptide receptor (AAU00439-AAU0044 0) and a possible. 

Polypeptides and polynucleotides of the neuropeptide receptor are useful 

CC for diagnosing, preventing, or treating a pathological condition in a 

CC subject related to the central nervous and peripheral nervous systems 

CC (CNS and PNS) . The polypeptides and polynucleotides may be used to treat 

CC hyperproliferative, cardiovascular, autoimmune, nervous system or 

CC infectious disorders e.g. cancer, heart disease, rheumatoid arthritis, 

CC Alzheimer's disease, HIV infection and diabetes mellitus . In particular 

CC they are useful for preventing, treating or ameliorating a medical 

CC condition in a mammal such as obesity/eating behaviour disorders, 

CC narcolepsy, neurological disease, addiction to narcotics, nicotine and 

CC alcohol, chronic pain, acute pain, migraine headaches and anxiety 

CC disorders. The polynucleotides encoding the neuropeptide receptor can 

CC also be used in gene therapy methods for treating such diseases. Note the 

CC present sequence is not given in the patent but is indexed from the wild 

CC type sequence shown in Figure 4 (AAU004 38) 
XX 

SQ Sequence 364 AA; 



Query Match 86.0%; Score 1908; DB 4; Length 364; 

Best Local Similarity 100.0%; Pred. No. 9.7e-189; 

Matches 364; Conservative 0; Mismatches 0; Indels 0; Gaps 
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SREPSPVPPDYEDEFL R Y LW RD YL Y P KQ YEVi V L I AAYVAVFWALVGNT L VC LAVWRN HH 

iiiiii ii 1 1 ii 1 1 ii 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 

SREPSPVPPDYEDEFLRYLWRDYLYPKQYEWVLIAAYVAVFWALVGNTLVCLAVWRNHH 


76 


Db 
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MRTWNYFI WLS]^\DVLVTAICLPASLLVT)ITESWLFGHALCKV 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I I I | 1 II 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I I I I I I M 

MRTWNYFIWLSIJVDVLWAICLPASLLVTIITESWLFGHALCKVIPYLQAVSVSVAVLT 
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Db 
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QY 
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LSFIALDRWYAICHPLLFKSTARRARGSILGIWAVSLAIMVPQAAVT^ECSSVLPELANRT 

1 1 1 M 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 II 1 1 1 M 1 II | | | I | 1 | 

LSFIALDRWYAICHPLLFKSTARRARGSILGIWAVS LAIMVPQAAVMECSSVLPELANRT 


196 


Db 


121 


180 



Qy 


197 


Db 


181 


Qy 


257 


Db 


241 


Qy 


317 


Db 


301 


Qy 


377 


Db 


361 



I I I I I I I I I I I I I I I I I I t I I I I I I I I | | I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I 

RLFSVCDERWADDLYPKIYHSCFFIVTYLAPLGLMAMAYFQIFRKLWGRQIPGTTSALVR 
NWKRP S DQLGDLEQGL S GE PQ P RARAFLAEVKQMRARRKTAKMLMWLLVFALC YLp I S V 

I I I I I I I I M M I I I I I I II I I I I I I I I I I I I I I I I I I | | | | | | | M I I II I I I I I I I I I 

NWKRPSDQLGDLEQGLSGEPQPRARAFI^VKQMRARRKTAKMLMWLLVFALCYLPISV 

LNVLKRVFGMFRQAS DREAVYAC FT FS HWLVYANS AAN P 1 1 YN FLS GKFREQ FKAAFS CC 
I I I I I I I I I I I II I I I I II I I I I I I II I I II I I I I II I I I I I I I II I I I I I I I I I I I I | | 
LNVLKRVFGMFRQAS DREAVYAC FT FSHWLVYANSAANP 1 1 YN FLS GKFREQ FKAAFSCC 

LPGL 380 
I I I I 



RESULT 15 
AAW80805 

ID AAW80805 standard; protein; 389 AA. 
XX 

AC AAW80805; 
XX 

DT 29-JAN-1999 (first entry) 
XX 
DE 
XX 
KW 
KW 



Amino acid sequence of HFGAN72Y a G-protein coupled receptor. 

G-protein coupled receptor family; HFGAN72Y; mutation; probe; agonist; 
antagonist; activation; inhibition; gene therapy; antibody; 



KW immune response; vaccine; HIV-1; HIV-2; cancer; anorexia; bulimia; 



myocardial infarction; ulcer; allergies; psychotic disorder; 
neurological disorder; gene mapping. 



KW asthma; Parkinson ? s disease; acute heart failure; hypotension; 
KW hypertension; urinary retention; osteoporosis; angina pectoris; 
KW " ■ 

KW 
XX 

OS Homo sapiens. 
XX 

PN EP875565-A2. 
XX 

PD 04-NOV-1998. 
XX 

PF 27-OCT-1997; 97EP-00308554 . 
XX 

PR 30-APR-1997; 97US-00846705 . 
XX 

PA (SMIK ) SMITHKLINE BEECHAM CORP. 
XX 

PI Bergsma DJ, Ellis C; 
XX 

DR WPI; 1998-570286/49. 
DR N-PSDB; AAV68511. 
XX 

PT New G-protein coupled receptor HFGAN72Y polypeptide and polynucleotide - 
PT useful as diagnostic reagents and for prevention and treatment of HIV 
PT infections, cancer, osteoporosis and Parkinson's disease 
XX 

PS Claim 1; Page 7; 22pp; English. 



XX 

CC This is the amino acid sequence of the G-protein coupled receptor, 

CC HFGAN72Y used in the method of the invention. HFGAN72Y polypeptides and 

CC polynucleotides are useful for diagnosing susceptibility to diseases by 

CC detecting mutations in the HFGAN72Y gene using probes containing the 

CC HFGAN72Y nucleotide sequence, and can diagnose diseases associated with 

CC HFGAN72Y imbalance by determining HFGAN72Y polypeptide or mRNA expression 

CC levels. Agonists/antagonists can be used in treatment to activate/inhibit 

CC HFGAN72Y activity, in addition to direct administration of antisense 

CC sequences to prevent expression, or HFGAN72Y polypeptides to treat 

CC conditions associated with a lack HFGAN72Y protein. Gene therapy may also 

CC be used to affect endogenous HFGAN72Y polypeptide production. HFGAN72Y 

CC antibodies are useful for inducing an immune response to immunise and 

CC prevent diseases, and for isolating HFGAN72Y clones or purifying the 

CC polypeptides by affinity chromatography. HFGAN72Y polypeptides can be 

CC administered directly or as a vaccine to inoculate against diseases. 

CC Diseases diagnosed, prevented or treated include HIV-1 or HIV-2 

CC infections, pain, cancers, anorexia, bulimia, asthma, Parkinson's 

CC disease, acute heart failure, hypotension, hypertension, urinary 

CC retention, osteoporosis, angina pectoris, myocardial infarction, ulcers; 

CC allergies, benign prostatic hypertrophy, and psychotic and neurological 

CC disorders. The HFGAN72Y polypeptide is also useful for mapping the gene 

CC to a chromosome, allowing gene inheritance to be studied through linkage 

CC analysis 

XX 

SQ Sequence 389 AA; 

Query Match 85.7%; Score 1902.5; DB 2; Length 389; 

Best Local Similarity 96.1%; Pred. No. 4e-188; 

Matches 368; Conservative 2; Mismatches 10; Indels 3; Gaps 2; 
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MEP S AT P GAQMGVP PGS REP S P VP PD YEDE FLRYLWRD YL YPKQYEWVL I AAYVAVFVVA 

1 1 1 1 1 N 1 1 1 1 1 1 1 1 1 1 | | | 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I I I | | | | | | | | | | | | | | | | | | 

MEPSATPGAQMGVPPGSREPSPVPPDYEDEFLRYLWRDYLYPKQYEWVLIAAYVAVFWA 
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Db 
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LVGNTLVCIAVWRNHHMRTWNYFI VTtf 

1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 I I I I | | | | | | M | | | | | | | | | | | | | | || || | | | | | | | 

LVGNTLVCIAWRNHHMRTWNYFIWLSIADVLVTAICLPASLLVDITESWLFGHALCK 
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Db 
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VI P YL QAVS VS VAVLT L S F I AL D RW YAI CHPLLFKS T ARRARG S I L G I WAVS LAI MVP Q A 

1 1 1 1 1 1 1 1 1 1 1 I I | | I | | | | | | M 1 1 1 1 I 

VI P YL QAVS VS VAVLT L S FI AL DRW YAI CH P L L FK S T ARRARG S I LG I WAVS LAI MVP QA 


180 


Db 
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180 


Qy 


181 


AWIECSSVLPELANRTRLFSVCDERWADDLYPKIYHSCFFIVTYLAPLGLMAMAYFQIFR 

I 1 M 1 1 1 1 II 1 1 1 1 II M 1 1 1 1 1 1 II 1 1 | 1 I | | 

A'/MECSSVLPELANRTRLFSVCDERWAI)DLYPKIYHSCFFIWYLAPLGLMAi-'IAYFQIFR 


240 


Db 
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240 


Qy 
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KLWGRQIPGTTSALVRNWKRPSDQLGDLEQGLSGEPQPRARAFLAEVKQMRARRKTAKML 

1 > 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II I | 

KLWGRQIPGTTSALVRNWKRPSDQLGDLEQGLSGEPQPRGRAFLAEVKQMRARRKTAKML 


300 


Db 
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Qy 
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MWLL VFAL C YL P I S VLNVL KRVFGMFRQAS D REAVYAC FT F S HWL VYAN S AAN P 1 1 YN F 

1 ' 1 1 I 1 1 1 1 1 1 f 1 1 I I I | | | | | | | | | | f | | M 1 1 1 1 1 I 1 1 1 | | | | f | | | | | | | | M II 1 1 
MWLLVFALC YLP I S VLNVL KRVFGMFRQAS D REAVYAC FT FS HWL VYAN S AAN P 1 1 YN F 


360 


Db 


301 


360 



Qy 



361 LSGKFREQFKAAFSC-CLPGLGP 382 
III : : III II I 



Db 



361 LSG--CKEKSLALSCPSCPGHDP 381 



Search completed: September 28, 2004, 09:46:56 
Job time : 130 sees 



GenCore version 5.1.6 
Copyright (c) 1993 - 2004 Compugen Ltd. 



OM protein - protein search, using sw model 
Run on: 



Title: 

Perfect score: 
Sequence : 

Scoring table: 



Searched: 



September 28, 2004, 09:44:56 ; Search time 34 Seconds 

(without alignments) 
645.325 Million cell updates/sec 
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389414 seqs, 51625971 residues 
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Total number of hits satisfying chosen parameters: 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 
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q 9 i i n /i 
y / x— x u4 


Sequence 


104, App 


9 9 
3Z 


yl 9 A 
4 94 


x y . 


cz 
b 


9 9 n 

9 / U 


A 

4 


T T O 

U o~ 


uy- 


/ y y- 


Q R R R 

y33-3 


Sequence 


5, 


Appli 


9 "3 


/! 9 /I 

4 94 


1 Q 

x y . 


cz 
b 


9 9 n 

9 / U 


4 


U 0- 


u y- 


4 bX- 


430B-Z b 


Sequence 


26 


, Appl 


9 /l 
3 4 


/l 9 A 
4 9 4 


1 Q 

x y . 


b 


9 Q 1 
9 O X 


X 


TT C 

u 0— 


u 0 - 


x yz — 


O Q Q O 

z! 0 0 — z 


Sequence 


2, 


Appli 


o c 
oo 


/l O A 

4 94 


ly . 


r 
b 


O O T 

381 


z 


us- 


08- 


/TOT 

687- 


O C C 7\ O 

355A-2 


Sequence 


2, 


Appli 


9 CZ. 
3 O 


A 9 /I 
4 9 4 


1 Q 

x y . 


b 


9 Q 1 
9 O X 


4 


TTc 

U 0 — 


u y — 


9 n n 
z U (J — 


cz 1 0 i c: 
b / 3 — X b 


Sequence 


16 


, Appl 


9 9 
3 / 


/I 9 /I 
4 9 4 


1 Q 

x y . 


cz 
b 


001 
9 0 X 


A 

4 


us - 


uy- 


/i n 7 
4 U / — 


z> iZn 0 
3D / — Z 


Sequence 


2, 


Appli 


9 Q 
3 O 


/ 9 9 R 
43Z . 3 


x y . 


3 


9 9 n 
3 / U 


9 


Uo- 


u y- 


X /z- 


O C O "7 
333- / 


Sequence 


1, 


Appli 


9 Q 

3 y 


A 9 9 R 
43z . D 


1 Q 

x y . 


3 


97 n 

9 / U 


4 


T T C? 

Uo 


n q 
U y — 


7 Q Q 

/ y y — 


Q R R 7 

y33— / 


Sequence 


1, 


Appli 


a o 

4 u 


/19Q R 
4 Z y . 3 


1 Q 

x y . 


A 
4 


9 ft 1 
9 O X 


Z 


TT C5 — 
UO 


0 ft — 
Uo 


£ft 7 — 

bo/ 


Q R R 7\ 

3 3 3/\ b 


Sequence 


6, 


Appli 


41 


429.5 


19. 


4 


381 


4 


us- 


09- 


407- 


367-6 


Sequence 


6, 


Appli 


42 


428.5 


19. 


3 


428 


1 


us- 


08- 


570- 


157-5 


Sequence 


5, 


Appli 


43 


428 . 5 


19. 


3 


428 


3 


us- 


08- 


029- 


170-31 


Sequence 


31, Appl 


44 


428.5 


19. 


3 


428 


4 


us- 


09- 


076- 


510-5 


Sequence 


5, 


Appli 


45 


428.5 


19. 


3 


428 


4 


us- 


09- 


004- 


349-5 


Sequence 


5, 


Appli 



ALIGNMENTS 



RESULT 1 
US-09-479-128-2 

; Sequence 2, Application US/09479128 
; Patent No. 6319710 
; GENERAL INFORMATION: 

; APPLICANT: Berglind Ran Olafsdottir 
; APPLICANT: Jeffrey Gulcher 

; TITLE OF INVENTION: HUMAN NARCOLEPSY GENE 

; FILE REFERENCE: 2345.1005-001 

; CURRENT APPLICATION NUMBER: US/ 09/47 9 , 12 8 

; CURRENT FILING DATE: 2000-01-07 

; PRIOR APPLICATION NUMBER: US 09/379,083 

; PRIOR FILING DATE: 1999-08-23 

; NUMBER OF SEQ ID NOS : 22 

SOFTWARE: FastSEQ for Windows Version 4.0 
; SEQ ID NO 2 

LENGTH: 425 

TYPE: PRT 



; ORGANISM: Homo Sapiens 
US-09-479-128-2 



Query Match 100.0%; Score 2218; DB 4; Length 425; 

Best Local Similarity 99.8%; Pred. No. 7.7e-193; 

Matches 424; Conservative 1; Mismatches 0; Indels 0; Gaps 0; 

Qy 1 MEPSATPGAQMGVPPGSREPSPVPPDYEDEFLRYLWRDYLYPKQYEWVLIAAYVAVFWA 60 

I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I 
Db 1 MEPSATPGAQMGVPPGSREPSPVPPDYEDEFLRYLWRDYLYPKQYEWVLIAAYVAVFWA 60 

Qy 61 LVGNTLVCLAWRNHHMRTWNYFIWLSLADVLVTAICLPASLLVDITESWLFGHALCK 120 

I I I I I I II I I I I II I I I I I II II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 61 LVGNTLVCLAWRNHHMRTVTNYFIWLSLADVLVTAICLPASLLVDITESWLFGHALCK 120 

Qy 121 VIPYLQAVSVSVAVLTLSFIALDRWYAICHPLLFKSTARRARGSILGIWAVSLAIMVPQA 18 0 

II I I I I I I II I I I I I I I II I I I I I I I I I I I II I I I I I I I I I I I II I I I I I I I I I I I I I I I 

Db 121 VIPYLQAVSVSVAVLTLSFIALDRWYAICHPLLFKSTARRARGSILGIWAVSLAIMVPQA 180 

Qy 181 AVT^ECSSVLPELANRTRLFSVCDERWADDLYPKIYHSCFFIVTYLAPLGLMAMAYFQIFR 240 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 181 AViyiECSSVLPELANRTRLFSVCDERWADDLYPKIYHSCFFIVTYLAPLGLMAMAYFQIFR 240 

Qy 241 KLWGRQIPGTTSALVRNWKRPSDQLGDLEQGLSGEPQPRARAFLAEVKQMRARRKTAKML 300 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 241 KLWGRQIPGTTSALVRNWKRPSDQLGDLEQGLSGEPQPRARAFLAEVKQMRARRKTAKML 300 

Qy 301 MWL LVFALC YL P I S VLN VL KRVFGMFRQAS DREAVYAC FT FS HWLVYAN S AAN P 1 1 YN F 360 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 301 MWLLVFALCYLPISVLNVLKRVFGMFRQASDREAVYACFTFSHWLVYANSAANPIIYNF 360 

Qy 361 LSGKFREQFKAAFSCCLPGLGPCGSLKAPSPRSSASHKSLSLQSRCSVSKISEHWLTSV 420 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I : I I II I I I I I I I I 
Db 361 LSGKFREQFKAAFSCCLPGLGPCGSLKAPSPRSSASHKSLSLQSRCSISKISEHWLTSV 420 

Qy 421 TTVLP 425 

I I I I I 

Db 421 TTVLP 425 



RESULT 2 
US-08-846-704-2 

; Sequence 2, Application US/08846704 

; Patent No. 6020157 

; GENERAL INFORMATION: 

APPLICANT: BERGSMA, DERK J. 
APPLICANT: ELLIS, CATHERINE E. 
; TITLE OF INVENTION: NOVEL G-PROTEIN COUPLED 

; NUMBER OF SEQUENCES: 4 
; CORRESPONDENCE ADDRESS: 

ADDRESSEE: RATNER & PRESTIA 
; STREET: P.O. BOX 980 

; CITY: VALLEY FORGE 

STATE: PA 

COUNTRY: USA 

ZIP: 19482 
COMPUTER READABLE FORM: 



MEDIUM TYPE: Diskette 
COMPUTER: IBM Compatible 
OPERATING SYSTEM: DOS 

SOFTWARE: FastSEQ for Windows Version 2.0 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/08/ 84 6, 704 
FILING DATE: 30-APR-1997 
CLASSIFICATION: 435 
PRIOR APPLICATION DATA: 
APPLICATION NUMBER: 
FILING DATE: 
ATTORNEY/AGENT INFORMATION: 
NAME: PRESTIA, PAUL F 
REGISTRATION NUMBER: 23,031 
REFERENCE/ DOCKET NUMBER: GH-70002 
TELECOMMUNICATION INFORMATION: 
TELEPHONE: 610-4 07-0700 
TELEFAX: 610-407-0701 
TELEX: 846169 
INFORMATION FOR SEQ ID NO: 2: 
SEQUENCE CHARACTERISTICS: 
LENGTH: 425 amino acids 
TYPE: amino acid 
STRANDEDNESS : single 
TOPOLOGY: linear 
MOLECULE TYPE: protein 
US-08-846-704-2 

Query Match 99.8%; Score 2214; DB 3; Length 425; 

Best Local Similarity 99.5%; Pred. No. 1.8e-192; 

Matches 423; Conservative 1; Mismatches 1; Indels 0; Gaps 0 
Qy 1 MEPSATPGAQMGVPPGSREPSPVPPDYEDEFLRYLWRDYLYPKQYEWVLIAAYVAVFWA 60 

I I I I I I I I I I I II I II I I I I I I I I I I I I M I I I I I I I I I I ! I I I I I I I I I I 1 I I I I I I I I 

Db 1 MEP S AT PGAQMGVP PGS RE P S PVP P D YEDE FLRYLWRDYL YPKQ YEWVL I AAYVAVFWA 60 

Qy 61 LVGNTLVCLAWRNHHMRTWNYFIWLSIAI)VLVTAICLPASLLVDITESWLFGHALCK 12 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I II I I I I I I I I I 1 I I I I I I I 
Db 61 LVGNTLVCIAWRNHHMRTWNYFIWLSLADVLVTAICLPASLLVDITESWLFGHALCK 12 0 

Qy 121 VIPYLQAVSVSVAVLTLSFIALDRWYAICHPLLFKSTARRARGSILGIWAVSLAIMVPQA 180 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 121 VIPYLQAVSVSVAVLTLSFIALDRWYAICHPLLFKSTARRARGSILGIWAVSLAIMVPQA 180 

Qy 181 AVMECSSVLPELANRTRLFSVCDERWADDLYPKIYHSCFFIVTYLAPLGLMAMAYFQIFR 24 0 

I I I I ! i I I I I I I I I I ! II I I I I I I I I I I I I I I I I I I I I I I I I I I 1 I I I I I I I I I I I i ! I I 

Db 181 AVHECSSVLPELANRTRLFSVCDERWADDLYPKIYHSCFFIVTYLAPLGLMAMAYFQIFR 240 

Qy 241 KLWGRQIPGTTSALVRNWKRPSDQLGDLEQGLSGEPQPRARAFLAEVKQMRARRKTAKML 300 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 241 KLWGRQIPGTTSALVRNWKRPSDQLGDLEQGLSGEPQPRGRAFLAEVKQMRARRKTAKML 300 

Qy 301 MWLLVFALCYLPISVLNVLKRVFGMFRQASDREAVYACFTFSHWLVYANSAANPIIYNF 360 

I I I I I I I I II I I I I I I I I I I I I I I I I II I I I I M I I I II I I I I I I I I I I I I I I I I I I I I I 

Db 301 MWLLVFALCYLPISVLNVLKRVFGMFRQASDREAVYACFTFSHWLVYANSAANPIIYNF 360 



QY 



361 



LSGKFREQFKAAFSCCLPGLGPCGSLKAPSPRSSASHKSLSLQSRCSVSKISEHWLTSV 420 



Db 361 LSGKFREQFKAAFSCCLPGLGPCGSLKAPSPRSSASHKSLSLQSRCSISKISEHWLTSV 420 

Qy 421 TTVLP 425 

I I I I I 

Db 421 TTVLP 425 



RESULT 3 

US-09-211-823C-22 

Sequence 22, Application US/09211823C 
Patent No. 6664229 
GENERAL INFORMATION: 
APPLICANT: HAGEN, JAMES JOSEPH 
APPLICANT: TERRETT, JONATHAN ALEXANDER 
APPLICANT: UPTON, NEIL 
APPLICANT: PIPER, DAVID 
APPLICANT: SMITH, MARTIN IAN 
APPLICANT: KENNETT, GUY ANTHONY 
APPLICANT: PATEL, SARASWATI R. 

TITLE OF INVENTION: METHODS OF TREATMENT USING NOVEL LIGANDS 
TITLE OF INVENTION: OF THE NEUROPEPTIDE RECEPTOR HFGAN72 AND AGONISTS OR 
TITLE OF INVENTION: ANTAGONISTS THEREOF 
FILE REFERENCE: P50745 

CURRENT APPLICATION NUMBER: US/09/2 11 , 823C 
CURRENT FILING DATE: 1998-12-15 
PRIOR APPLICATION NUMBER: US 60/069,459 
PRIOR FILING DATE: 1997-12-15 
PRIOR APPLICATION NUMBER: US 60/069,785 
PRIOR FILING DATE: 1997-12-16 
NUMBER OF SEQ ID NOS : 23 

SOFTWARE: FastSEQ for Windows Version 3.0 
SEQ ID NO 22 
LENGTH: 425 
TYPE: PRT 

ORGANISM: HOMO SAPIENS 
US-09-211-823C-22 

Query Match 99.8%; Score 2214; DB 4; Length 425; 

Best Local Similarity 99.5%; Pred. No. 1.8e-192; 

Matches 423; Conservative 1; Mismatches 1; Indels 0; Gaps 0; 

Qy 1 MEPSATPGAQMGVPPGSREPSPVPPDYEDEFLRYLWRDYLYPKQYEWVLIAAYVAVFWA 60 

II I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I II I I I I I I I I I I I 1 I I I 
Db 1 MEPSATPGAQMGVPPGSREPSPVPPDYEDEFLRYLWRDYLYPKQYEWVLIAAYVAVFWA 60 

Qy 61 LVGNT L VC LAVWRN H HMRT VTN Y F I VN L S LAD VLVT AI C L PAS L LVD I T E S WL FGHAL C K 12 0 

I I I I I I I II I I I I I I I I I! I I I i I ! I I I I i I I I I I I I I I I I I I I M I I I I I I I I I I I I I I 

Db 61 LVGNTLVCLAWRNHHMRTWNYFIWLSIADVLVTAICLPASLLVDITESWLFGHALCK 12 0 

Qy 121 VIPYLQAVSVSVAVLTLSFIALDRWYAICHPLLFKSTARRARGSILGIWAVSLAIMVPQA 180 

I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I 1 I I I I I I I I I I I I I I 

Db 121 VIPYLQAVSVSVAVLTLSFIALDRWYAICHPLLFKSTARRARGSILGIWAVSLAIMVPQA 180 

Qy 181 AV>IECSSVLPELANRTRLFSVCDERWADDLYPKIYHSCFFIVTYLAPLGLMAMAYFQIFR 24 0 

II I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 181 AWIECSSVLPELANRTRLFSVCDERWADDLYPKIYHSCFFIVTYLAPLGLMAMAYFQIFR 240 



Qy 241 KLWGRQIPGTTSALVRNWKRPSDQLGDLEQGLSGEPQPRAR?\FLAEVKQMRARRKTAKML 300 

I I I I I I I I I III I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I II I I I I I I I I 
Db 241 KLWGRQIPGTTSALVRNWKRPSDQLGDLEQGLSGEPQPRGRAFLAEVKQMRARRKTAKML 300 

Qy 301 MVVLLVFALCYLPISVLNVLKRVFGMFRQASDREAVYAC FTFSHWLVYANSAANPII YNF 360 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I 
Db 301 MWLLVFALCYLPISVLNVLKRVFGMFRQASDREAVYACFTFSHWLVYANSAANPIIYNF 360 

Qy 361 LSGKFREQFKAAFSCCLPGLGPCGSLKAPSPRSSASHKSLSLQSRCSVSKISEHWLTSV 420 

I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I : I II I I I I I I I I I 
Db 361 LSGKFREQFKAAFSCCLPGLGPCGSLKAPSPRSSASHKSLSLQSRCSISKISEHWLTSV 420 

Qy 421 TTVLP 425 

I II I I 

Db 421 TTVLP 425 



RESULT 4 
US-08-846-704-4 

; Sequence 4, Application US/08846704 

; Patent No. 6020157 

; GENERAL INFORMATION: 

APPLICANT: BERGSMA, DERK J . 
APPLICANT: ELLIS, CATHERINE E. 
TITLE OF INVENTION: NOVEL G-PROTEIN COUPLED 
NUMBER OF SEQUENCES: 4 
; CORRESPONDENCE ADDRESS: 

ADDRESSEE: RATNER & PRESTIA 
STREET: P.O. BOX 980 
CITY: VALLEY FORGE 
STATE: PA 
COUNTRY: USA 
ZIP: 19482 
COMPUTER READABLE FORM: 
MEDIUM TYPE: Diskette 
; COMPUTER: IBM Compatible 

OPERATING SYSTEM: DOS 

SOFTWARE: FastSEQ for Windows Version 2.0 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/08/84 6,704 

FILING DATE: 30-APR-19 97 

CLASSIFICATION: 435 
; ' PRIOR APPLICATION DATA: 
; APPLICATION NUMBER: 

; FILING DATE: 

ATTORNEY/AGENT INFORMATION: 

NAME: PRESTIA, PAUL F 

REGISTRATION NUMBER: 23,031 

REFERENCE/DOCKET NUMBER: GH-70002 
; TELECOMMUNICATION INFORMATION: 

TELEPHONE: 610-407-0700 

TELEFAX: 610-407-0701 

TELEX: 846169 
; INFORMATION FOR SEQ ID NO: 4: 
; SEQUENCE CHARACTERISTICS: 

LENGTH: 4 02 amino acids 



TYPE: amino acid 
STRANDEDNESS : single 
TOPOLOGY: linear 
MOLECULE TYPE: protein 
US-08-846-704-4 

Query Match 95.0%; Score 2108; DB 3; Length 402; 

Best Local Similarity 100.0%; Pred. No. 6.6e-183; 

Matches 4 02; Conservative 0; Mismatches 0; Indels 0; Gaps 0 

Qy 1 MEPSATPGAQMGVPPGSREPSPVPPDYEDEFLRYLWRDYLYPKQYEWLIAAWAVFWA 60 

I I I I I I I I I I I I I I I I I I I I I M I I I II I I II I I I I I I I I I I I I I I I I I I I I I I I I | | | | 
Db 1 MEPSATPGAQMGVPPGSREPSPVPPDYEDEFLRYLWRDYLYPKQYEWLIAAYVAVFVVA 60 

Qy 61 L VGNT L VC LAVWRN H HM RT VTN Y F I VN L S LAD VL VT AI C L PAS L LVD I T E S W L FGHAL C K 12 0 

I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I II I I I I I I | | | | 
Db 61 LVGNTLVCLAWRNHHMRTWNYFIWLSIADVLWAICLPASLLVDITESWLFGHALCK 12 0 

Qy 121 VI P YLQAVS VS VAVLTLS FI ALDRWYAI CHPLLFKSTARRARGS I LGIWAVSLAIMVPQA 180 

I I I I I I I I I I II I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I | | | | 
Db 121 VI PYLQAVSVSVAVLTLSFI ALDRWYAI CHPLLFKSTARRARGS I LGIWAVSLAIMVPQA 180 

Qy 181 AVMECSSVLPELANRTRLFSVCDERWADDLYPKIYHSCFFIWYLAPLGLMAMAYFQIFR 24 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 181 AVMECSSVLPELANRTRLFSVCDERWADDLYPKIYHSCFFIVTYLAPLGLMAMAYFQIFR 240 

Qy 241 KLWGRQIPGTTSALVRNWKRPSDQLGDLEQGLSGEPQPRARAFLAEVKQMRARRKTAKML 300 

I I I M I I I I I I I I I I I i I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I 
Db 241 KLWGRQ I PGTT S ALVRNWKRP S DQLGDLEQGLS GE PQP RARAFLAEVKQMRARRKTAKML 300 

Qy 301 MWL LVFAL C Y L P I S VLN VL K RVF GM F RQ AS D REAVYAC FT F S H W L VYAN S AAN P 1 1 YN F 360 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I 

Db 301 MWLLVFALCYLPISVLNVLKRVFGMFRQASDREAVYACFTFSHWLVYANSAANPIIYNF 360 

Qy 361 LSGKFREQFKAAFSCCLPGLGPCGSLKAPSPRSSASHKSLSL 402 

I II I I I I I I I II I I I I II I I I I I I II I I I I I I II I I I I I I I I 
Db 361 LSGKFREQFKAAFSCCLPGLGPCGSLKAPSPRSSASHKSLSL 402 



RESULT 5 

US-08-462-509B-2 

; Sequence 2, Application US/08462509B 

; Patent No. 6410701 

; GENERAL INFORMATION: 

; APPLICANT: Soppet, Daniel et al 

TITLE OF INVENTION: Human Neuropeptide Receptor 
NUMBER OF SEQUENCES: 12 
CORRESPONDENCE ADDRESS: 

ADDRESSEE: Human Genome Sciences, Inc. 
STREET: 9410 Key West Avenue 
CITY: Rockiville 
STATE: MD 
COUNTRY: USA 
ZIP : 20850 
COMPUTER READABLE FORM: 

MEDIUM TYPE: Floppy disk 
COMPUTER: IBM PC compatible 



OPERATING SYSTEM: PC-DOS/MS-DOS 

SOFTWARE : Patentln Release #1.0, Version #1.30 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/08/462 , 509B 
FILING DATE: 05-JUN-1995 
CLASSIFICATION: 435 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: WO PCT/US 95/ 0561 6 
FILING DATE: 05-MAY-1995 
ATTORNEY/AGENT INFORMATION: 
NAME: Wales, Michele M. 
REGISTRATION NUMBER: 43,975 
REFERENCE/DOCKET NUMBER: PF168P1 
TELECOMMUNICATION INFORMATION: 
TELEPHONE: 301-309-8504 
TELEFAX: 301-309-8439 
INFORMATION FOR SEQ ID NO: 2: 
SEQUENCE CHARACTERISTICS: 
LENGTH: 4 02 amino acids 
TYPE: amino acid 
TOPOLOGY: linear 
MOLECULE TYPE: protein 
US-08-462-509B-2 

Query Match 94.8%; Score 2104; DB 4; Length 402; 

Best Local Similarity 99.8%; Pred. No. 1.5e-182; 

Matches 401; Conservative 0; Mismatches 1; Indels 0; Gaps 0; 

Qy 1 MEPSATPGAQMGVPPGSREPSPVPPDYEDEFLRYLWRDYLYPKQYEWVLIAAYVAVFWA 60 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1 MEPSATPGAQMGVPPGSREPSPVPPDYEDEFLRYLWRDYLYPKQYEWVLIAAYVAVFWA 60 

Qy 61 L VGNT LVC LAVWRNHHMRT VTN YFI VNL S LADVL VT AI CL P AS LLVD I T E S WL FGHAL CK 12 0 

I I I I I I II I I I I I I I I I I II II I I I I I I I I I I I I I I I I I I I I I I I I I I I I t I I I I I I ! I I 
Db 61 LVGNTLVCLAWRNHHMRTVTNYFIWLSIJ^VLVTAICLPASLLVDITESWLFGHALCK 12 0 

Qy 121 VIPYLQAVSVSVAVLTLSFIALDRWYAICHPLLFKSTARRARGSILGIWAVSLAIMVPQA 180 

I I I I I I I I I I II I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 121 VIPYLQAVSVSVAVLTLSFIALDRWYAICHPLLFKSTARRARGSILGIWAVSLAIMVPQA 18 0 

Qy 181 AVMECSSVLPELANRTRLFSVCDERWADDLYPKIYHSCFFIVTYLAPLGLMAMAYFQIFR 24 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 181 AVMECSSVLPELANRTRLFSVCDERWADDLYPKIYHSCFFIVTYLAPLGLMAMAYFQI FR 240 

Qy 241 KLWGRQIPGTTSALVRNWKRPSDQLGDLEQGLSGEPQPRARAFLAEVKQMRARRKTAKML 300 

I I I I I I I I M I I I I I I I I I I I I 1 I I I I I I I I I I I I I I I I I I I I I Mi I I I I I I I II I I I 

Db 241 KLWGRQIPGTTSALVRNWKRPSDQLGDLEQGLSGEPQPRGRAFLAEVKQMRARRKTAKML 300 

Qy 301 MWLL VFALC YL P I S VLNVLKRVFGMFRQAS D REAVYAC FT F S HWLVYAN S AAN P 1 1 YN F 360 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 301 MWLLVFALCYLP IS VLNVLKRVFGMFRQAS DREAWACFTFSHWLVYANSAANP 1 1 YNF 360 

Qy 361 LSGKFREQFKAAFSCCLPGLGPCGSLKAPSPRSSASHKSLSL 4 02 

II II I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I II I II 

Db 361 LSGKFREQFKAAFSCCLPGLGPCGSLKAPSPRSSASHKSLSL 4 02 



RESULT 6 

PCT-US95-05616-2 

; Sequence 2, Application PC/TUS9505616 
; GENERAL INFORMATION: 

APPLICANT: LI, ET AL. 

TITLE OF INVENTION: Human Neuropeptide Receptor 
NUMBER OF SEQUENCES: 12 
; CORRESPONDENCE ADDRESS: 

ADDRESSEE: CARELLA, BYRNE, BAIN, GILFILLAN, 

ADDRESSEE: CECCHI, STEWART & OLSTEIN 

STREET: 6 BECKER FARM ROAD 

CITY: ROSELAND 

STATE: NEW JERSEY 

COUNTRY: USA 

ZIP: 07068 
COMPUTER READABLE FORM: 

MEDIUM TYPE: 3.5 INCH DISKETTE 

COMPUTER: IBM PS/2 

OPERATING SYSTEM: MS-DOS 

SOFTWARE: WORD PERFECT 5.1 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: PCT/US95/05616 

FILING DATE: concurrently 

CLASSIFICATION: 
ATTORNEY/AGENT INFORMATION: 

NAME: FERRARO, GREGORY D. 
; REGISTRATION NUMBER: 36,134 

REFERENCE/DOCKET NUMBER: 325800-268 
TELECOMMUNICATION INFORMATION: 

TELEPHONE: 201-994-1700 

TELEFAX: 201-994-1744 
INFORMATION FOR SEQ ID NO: 2: 
SEQUENCE CHARACTERISTICS: 
; LENGTH: 402 AMINO ACIDS 

TYPE: AMINO ACID 

STRANDEDNESS: 

TOPOLOGY: LINEAR 
MOLECULE TYPE: PROTEIN 
PCT-US95-05616-2 



Query Match 94.8%; Score 2104; DB 5; Length 402; 

Best Local Similarity 99.8%; Pred. No. 1.5e-182; 

Matches 4 01; Conservative 0; Mismatches 1; Indels 0; Gaps 0 

Qy 1 MEPSATPGAQMGVPPGSREPSPVPPDYEDEFLRYLWRDYLYPKQYEWVLIAAYVAVFWA 60 

I I I Nil I I i II I I I II I M I I I I II I I I I I I I I I I I M I I I I I I | I I | | | | | | | i V I I I 
Db 1 MEP S AT PGAQMGVP PGS REP S PVP P D YEDEFLRYLWRD YLYPKQ YEWVL I AAYVAVFVVA 60 

Qy 61 LVGNTLVCIAWRNHHMRTVTNYFIWLSIADVLVTAICLPASLLVDITESWLFGHALCK 120 

I I I I I I I I I I M I I I I I I M I I M I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I 
Db 61 LVGNTLVCI^WRNHHMRTWNYFIWLSLADVLVTAICLPASLLVDITESWLFGHALCK 120 



Qy 121 VIPYLQAVSVSVAVLTLSFIALDRWYAICHPLLFKSTARRARGSILGIWAVSLAIMVPQA 180 

II I I I I I I I I I I I I I I I I II I I I I I I M I I I I I I I I II I I I I I I I I I I I I I I I I I I II I I 

Db 121 VIPYLQAVSVSVAVLTLSFIALDRWYAICHPLLFKSTARRARGSILGIWAVSLAIMVPQA 180 

Qy 181 AVMECSSVLPELANRTRLFSVCDERWADDLYPKIYHSCFFIVTYLAPLGLMAMAYFQIFR 240 



Db 181 AVMECSSVLPELANRTRLFSVCDERWADDLYPKIYHSCFFIVTYLAPLGLMAMAYFQIFR 240 

Qy 241 KLWGRQIPGTTSALVRNWKRPSDQLGDLEQGLSGEPQPRARAFLAEVKQMRARRKTAKML 300 

I II I I II I II I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I II I I I I I I 
Db 241 KLWGRQIPGTTSALVRNWKRPSDQLGDLEQGLSGEPQPRGRAFLAEVKQMRARRKTAKML 300 

Qy 301 MWLLVFALCYLPISVLNVLKRVFGMFRQASDREAVYACFTFSHWLVYANSAANPIIYNF 360 

I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I 
Db 301 MWL L VFAL C Y L P I S VLN VL KRVFGMF RQAS D REAVYAC FT F S HW LVYAN S AAN P 1 1 YN F 360 

Qy 361 LSGKFREQFKAAFSCCLPGLGPCGSLKAPSPRSSASHKSLSL 402 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 361 LSGKFREQFKAAFSCCLPGLGPCGSLKAPSPRSSASHKSLSL 402 



RESULT 7 
US-08-846-705-2 

; Sequence 2, Application US/08846705 

; Patent No. 5935814 

; GENERAL INFORMATION: 

APPLICANT: BERGSMA, DERK J. 

APPLICANT: ELLIS, CATHERINE E 

TITLE OF INVENTION: NOVEL G-PROTEIN COUPLED 
NUMBER OF SEQUENCES: 5 
; CORRESPONDENCE ADDRESS: 
; ADDRESSEE: RATNER & PRESTIA 

STREET: P.O. BOX 98 0 
CITY: VALLEY FORGE 
; STATE: PA 

COUNTRY: USA 
ZIP: 19482 
COMPUTER READABLE FORM: 
MEDIUM TYPE: Diskette 
COMPUTER: IBM Compatible 
OPERATING SYSTEM: DOS 
; SOFTWARE: FastSEQ for Windows Version 2.0 

CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/08/84 6, 705 
FILING DATE: 30-APR-1997 
CLASSIFICATION: 435 
PRIOR APPLICATION DATA: 
APPLICATION NUMBER: 
FILING DATE: 
ATTORNEY/AGENT INFORMATION: 
NAME: PRESTIA, PAUL F 
REGISTRATION NUMBER: 23,031 
; REFERENCE/ DOCKET NUMBER: GH-70003 

TELECOMMUNICATION INFORMATION: 
; TELEPHONE: 610-407-0700 

TELEFAX: 610-407-0701 
TELEX: 846169 
; INFORMATION FOR SEQ ID NO: 2: 
SEQUENCE CHARACTERISTICS: 
LENGTH : 38 9 amino acids 
TYPE: amino acid 
STRANDEDNESS: single 



TOPOLOGY: linear 
MOLECULE TYPE: protein 
US-08-846-705-2 



Query Match 85.7%; Score 1902.5; DB 2; Length 389; 

Best Local Similarity 96.1%; Pred. No. 2.6e-164; 

Matches 368; Conservative 2; Mismatches 10; Indels 3; Gaps 2 

Qy 1 MEPSATPGAQMGVPPGSREPSPVPPDYEDEFLRYLWRDYLYPKQYEWVLIAAYVAVFVVA 60 

I I I I I I I I I II I I I I I I I I I I I I I I I I I i I I I I I I I I I I I I I I I I I I I I I I I I I 1 I I I I I 
Db 1 MEPSATPGAQMGVPPGSREPSPVPPDYEDEFLRYLWRDYLYPKQYEWVLIAAYVAVFWA 60 

Qy 61 L VGN T L VC LAW RN H HMRT VT N Y F I VN L S LAD VL VT AI C L PAS L LVD I T E S W L FGHAL C K 120 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I 
Db 61 LVGNTLVCIJVWRNHHMRTWNYFIWLSIADVXVTAICLPASLLVDITESWLFGHALCK 120 

Qy 121 VIPYLQAVSVSVAVLTLSFIALDRWYAICHPLLFKSTARRARGSILGIWAVSLAIMVPQA 180 

I I I I I I I I I I I I I I I I I I I I I I I I I II II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 121 VIPYLQAVSVSVAVLTLSFIALDRWYAICHPLLFKSTARRARGSILGIWAVSLAIMVPQA 180 

Qy 181 AVMECSSVLPELANRTRLFSVCDERWADDLYPKIYHSCFFIVTYLAPLGLMAMAYFQIFR 240 

I I I I I I I I I I I I I I I I J I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 181 AVMECSSVLPELANRTRLFSVCDERWADDLYPKIYHSCFFIWYLAPLGLMAMAYFQIFR 240 

Qy 241 KLWGRQIPGTTSALVRNWKRPSDQLGDLEQGLSGEPQPRARAFLAEVKQMRARRKTAKML 3 00 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 241 KLWGRQI P GTT SALVRNWKRP S DQLGDLEQGL S GEPQ PRGRAFLAEVKQMRARRKTAKML 300 

Qy 301 MWL LVFAL C YL P I S VLNVL K RVFGM F RQ AS D REAVYAC FT F S HWLVYAN S AAN P 1 1 YN F 360 

I I II I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I II I i I I II I I I I I I I I I I I I I I 1 I 
Db 301 MWLLVFALC YL P I S VLNVL KRVFGMFRQAS D REAVYAC FT F S HWLVYAN S AAN P 1 1 YN F 360 

Qy 361 LSGKFREQFKAAFSC-CLPGLGP 382 

III :: I I I II I 

Db 361 LSG — CKEKSLALSCPSCPGHDP 381 



RESULT 8 

US-09-211-823C-23 

Sequence 23, Application US/09211823C 
Patent No. 6664229 
GENERAL INFORMATION: 
APPLICANT: HAGEN, JAMES JOSEPH 
APPLICANT: TERRETT, JONATHAN ALEXANDER 
APPLICANT: UPTON, NEIL 
APPLICANT: PIPER, DAVID 
APPLICANT: SMITH, MARTIN IAN 
APPLICANT: KENNETT, GUY ANTHONY 
APPLICANT: PATEL, SARASWATI R. 

TITLE OF INVENTION: METHODS OF TREATMENT USING NOVEL LIGANDS 
TITLE OF INVENTION: OF THE NEUROPEPTIDE RECEPTOR HFGAN72 AND AGONISTS OR 
TITLE OF INVENTION: ANTAGONISTS THEREOF 
FILE REFERENCE: P50745 

CURRENT APPLICATION NUMBER: US/ 09/2 11 , 823C 
CURRENT FILING DATE: 1998-12-15 
PRIOR APPLICATION NUMBER: US 60/069,459 
PRIOR FILING DATE: 1997-12-15 



; PRIOR APPLICATION NUMBER: US 60/069,785 
; PRIOR FILING DATE: 1997-12-16 
; NUMBER OF SEQ ID NOS : 23 

SOFTWARE: FastSEQ for Windows Version 3.0 
; SEQ ID NO 23 

LENGTH: 38 9 

TYPE: PRT 

ORGANISM: HOMO SAPIENS 
US-09-211-823C-23 

Query Match 85.7%; Score 1902.5; DB 4; Length 389; 

Best Local Similarity 96.1%; Pred. No. 2.6e-164; 

Matches 368; Conservative 2; Mismatches 10; Indels 3; Gaps 2; 

Qy 1 MEPSATPGAQMGVPPGSREPSPVPPDYEDEFLRYLWRDYLYPKQYEWVLIAAYVAVFWA 60 

II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I 

Db 1 MEPSATPGAQMGVPPGSREPSPVPPDYEDEFLRYLWRDYLYPKQYEWVLIAAYVAVFWA 60 

Qy 61 LVGNTLVCLAWRNHHMRTWNYFIWLSIADVLVTAICLPASLLVDITESWLFGHALCK 12 0 

I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I II I I I I I I I I I I I I I I I I I I I I I I II I I 
Db 61 L VGNT L VC LAVWRN H HMRT VTN Y F I VN L S LAD VL VT AI C L PAS LLVD I T E S WL FGHAL C K 120 

Qy 121 VI P Y LQ AVS VS VAVLT L S F I AL D RW YAI C H P LL FKS TARRARG S I LG I WAVS LAIMVP QA 180 

I I I I I I I I I I I I II I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 121 VIPYLQAVSVSVAVLTLSFIALDRWYAICHPLLFKSTARRARGSILGIWAVSLAIMVPQA 180 

Qy 181 AVTVIECSSVLPELANRTRLFSVCDERWADDLYPKIYHSCFFIWYL 240 

I I I I I I I I I II I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I 
Db 181 AWECSSVLPEI^NRTRLFSVCDERWADDLYPKIYHSCFFIVTYLAPLGLMAMAYFQIFR 240 

Qy 241 KLWGRQIPGTTSALVRNWKRPSDQLGDLEQGLSGEPQPRARAFLAEVKQMRARRKTAKML 300 

I I I I I I I I I I I I I I II I I I I I I I I I I II I I I I I I II I I I I I I I I I I I I I I 1 I I I I I I I I 
Db 241 KLWGRQ I P GTT S ALVRNWKRP S DQLGDLEQGLS GE PQPRGRAFLAEVKQMRARRKTAKML 300 

Qy 301 MWLLVFALCYLPISVLNVLKRVFGMFRQASDREAVYACFTFSHWLVYANSAANPIIYNF 360 

I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I II I I I I I I II I I I I II 
Db 301 MWLLVFALCYLP I S VLNVLKRVFGMFRQAS DREAVYAC FT FS HWLVYAN SAANP 1 1 YN F 360 

Qy 361 LSGKFREQFKAAFSC-CLPGLGP 382 

III : : III II I 

Db 361 LSG— CKEKSLALSCPSCPGHDP 381 



RESULT 9 

PCT-US95-05616-6 

; Sequence 6, Application PC/TUS9505616 
; GENERAL INFORMATION: 

APPLICANT: LI, ET AL. 

TITLE OF INVENTION: Human Neuropeptide Receptor 
; NUMBER OF SEQUENCES: 12 
; CORRESPONDENCE ADDRESS: 

; ADDRESSEE: CARELLA, BYRNE, BAIN, GILFILLAN, 

ADDRESSEE: CECCHI, STEWART & OLSTEIN 
STREET: 6 BECKER FARM ROAD 
CITY: ROSELAND 
STATE: NEW JERSEY 
COUNTRY: USA 



ZIP: 07068 
COMPUTER READABLE FORM: 

MEDIUM TYPE: 3.5 INCH DISKETTE 
; COMPUTER: IBM PS/2 

; OPERATING SYSTEM: MS-DOS 

; SOFTWARE: WORD PERFECT 5.1 

CURRENT APPLICATION DATA: 

APPLICATION NUMBER: PCT/US95/05616 
; FILING DATE: concurrently 

CLASSIFICATION: 
ATTORNEY/AGENT INFORMATION: 
NAME: FERRARO, GREGORY D. 
REGISTRATION NUMBER: 36,134 
REFERENCE/ DOCKET NUMBER: 325800-268 
TELECOMMUNICATION INFORMATION: 
TELEPHONE: 201-994-1700 
TELEFAX: 201-994-1744 
; INFORMATION FOR SEQ ID NO: 6: 
SEQUENCE CHARACTERISTICS: 
LENGTH: 377 BASE PAIRS 
TYPE: AMINO ACID 
STRANDEDNESS: SINGLE 
TOPOLOGY: LINEAR 
MOLECULE TYPE: cDNA 
PCT-US95-05616-6 

Query Match 85.7%; Score 1901; DB 5; Length 377; 

Best Local Similarity 96.6%; Pred. No. 3.4e-164; 

Matches 366; Conservative 2; Mismatches 9; Indels 2; Gaps 1 

Qy 1 MEPSATPGAQMGVPPGSREPSPVPPDYEDEFLRYLWRDYLYPKQYEWVLIAAYVAVFWA 60 

I I I I I I I I I I II I I I I I I I II I I I I I I I I I I I I II I I I 1 I I I I I I I I I I I I I I I I I I I I I 
Db 1 MEPSATPGAQMGVPPGSREPSPVPPDYEDEFLRYLWRDYLYPKQYEWVLIAAYVAVFWA 60 

Qy 61 LVGNTLVCLAWRNHHMRTWNYFIWLSLADVLVTAICLPASLLVDITESWLFGHALCK 120 

I I I I I I I I I II I I I I I I I I I I I I I I I I I I I t I I II I I I I I I I I I II I I I I I I I I I I II I I 
Db 61 LVGNTLVCLAWRNHHMRTVTNYFIWLSLADVLVTAICLPASLLVDITESWLFGHALCK 12 0 

Qy 121 VIPYLQAVSVSVAVLTLSFIALDRWYAICHPLLFKSTARRARGSILGIWAVSLAIMVPQA 180 

I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 121 VI PYLQAVSVSVAVLTLS FIALDRWYAI CHPLLFKSTARRARGS I LGIWAVSLAIMVPQA 180 

Qy 181 AVMECSSVLPELANRTRLFSVCDERWADDLYPKIYHSCFFIVTYLAPLGLMAMAYFQIFR 240 

I I I I I I I II I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I M I I I I I I I I I I I I I I 

Db 181 AVMECSSVLPELANRTRLFSVCDERWADDLYPKIYHSCFFIVTYLAPLGLMAMAYFQIFR 240 

Qy 241 KLWGRQI PGTT S ALVRNWKRP S DQLGDLEQGL S GEPQP RARAFLAEVKQMRARRKTAKML 300 



Db 



241 




300 



QY 



301 



MWLLVFALC YL P I S VLNVLKRVFGMFRQAS DREAVYAC FT FS HWLVYAN S AAN P 1 1 YN F 



360 




Db 



301 



MWLLVFALCYLPISVLNVLKRVFGMFRQASDREAVYACFTFSHWLVYANSAANPIIYNF 



360 



361 



LSGKFREQFKAAFSCCLPG 379 



Db 



361 



LSGCKEKSLVLSPSC— PG 377 



RESULT 10 
US-08-462-509B-4 

; Sequence 4, Application US/08462509B 

; Patent No. 6410701 

; GENERAL INFORMATION: 

; APPLICANT: Soppet, Daniel et al 

; TITLE OF INVENTION: Human Neuropeptide Receptor 
; NUMBER OF SEQUENCES : 12 
CORRESPONDENCE ADDRESS: 

ADDRESSEE: Human Genome Sciences, Inc. 

STREET: 9410 Key West Avenue 

CITY: Rockiville 

STATE: MD 

COUNTRY: USA 

ZIP: 20850 
COMPUTER READABLE FORM: 

MEDIUM TYPE: Floppy disk 

COMPUTER: IBM PC compatible 

OPERATING SYSTEM: PC-DOS/MS-DOS 
; SOFTWARE: Patentln Release #1.0, Version #1.30 

; CURRENT APPLICATION DATA: 

; APPLICATION NUMBER: US/08/462 , 509B 

; FILING DATE: 05-JUN-1995 

; CLASSIFICATION: 435 

; PRIOR APPLICATION DATA: 

; APPLICATION NUMBER: WO PCT/US95/ 05616 

; FILING DATE: 05-MAY-1995 

; ATTORNEY/AGENT INFORMATION: 

; NAME: Wales, Michele M. 

; REGISTRATION NUMBER: 43,975 

; REFERENCE/ DOCKET NUMBER: PF168P1 

; TELECOMMUNICATION INFORMATION: 

TELEPHONE: 301-309-8504 
; TELEFAX: 301-309-8439 

; INFORMATION FOR SEQ ID NO: 4: 

SEQUENCE CHARACTERISTICS: 
; LENGTH: 369 amino acids 

; TYPE: amino acid 

; TOPOLOGY: linear 

; MOLECULE TYPE: protein 
US-08-462-509B-4 

Query Match 85.5%; Score 1897; DB 4; Length 369; 

Best Local Similarity 99.7%; Pred. No. 7.6e-164; 

Matches 362; Conservative 0; Mismatches 1; Indels 0; Gaps 0 
Qy 1 ME P SAT P GAQMGVP P GS RE P S P VP P D YED E FL R YLWRD YL Y P KQ YEWVL I AAYVAVFWA 60 



Db 




Qy 



Db 



61 LVGNTLVCLAWRNHHMRTWNYFIWLSLAI^VLvTAICLPASLLVDITESWLFGHALCK 12 0 

I I I I II I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I 1 1 I I I I I I I I II I I I I I I I I 
61 LVGNTLVCLAWRNHHMRTWNYFIVNLSLADVLVTAICLPASLLVDITESWLFGHALCK 12 0 



Qy 121 VI P Y LQAVS VS VAVL T L S F I AL D RW YAI CHPLLFKS T ARRARG S I L G I WAVS LAI MVPQ A 18 0 



Db 121 VI P YLQAVSVSVAVLTLS FI ALDRWYAI CHPLLFKSTARRARGSI LGIWAVS LAI MVP Q A 180 

Qy 181 AVMECSSVLPELANRTRLFSVCDERWADDLYPKIYHSCFFIVTYLAPLGLMAMAYFQIFR 240 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I II I I I I I I I 
Db 181 AVMECSSVLPELANRTRLFSVCDERWADDLYPKIYHSCFFIVTYLAPLGLMAMAYFQIFR 240 

Qy 241 KLWGRQIPGTTSALVRNWKRPSDQLGDLEQGLSGEPQPRARAFLAEVKQMRARRKTAKML 300 

I I I I I I I II I I I I I I II I I I II I I I I I I I I I I I I I I I I I I I I I I II I II I I I I I I I I I I 

Db 241 KLWGRQIPGTTSALVRNWKRPSDQLGDLEQGLSGEPQPRGRAFLAEVKQMRARRKTAKML 300 

Qy 301 MWLLVFALC YLP I S VLNVLKRVFGMFRQAS D REAVYAC FT FS HWLVYAN S AAN P 1 1 YN F 360 

II I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I 

Db 301 MWLLVFALC YLP I S VLNVLKRVFGMFRQAS D REAVYAC FT FS HWLVYAN S AANP 1 1 YN F 360 

Qy 361 LSG 363 

I I I 

Db 361 LSG 363 



RESULT 11 
PCT-US95-05616-4 

; Sequence 4, Application PC/TUS9505616 
; GENERAL INFORMATION: 

APPLICANT: LI, ET AL. 
; TITLE OF INVENTION: Human Neuropeptide Receptor 
NUMBER OF SEQUENCES: 12 
CORRESPONDENCE ADDRESS: 

ADDRESSEE: CARELLA, BYRNE f BAIN, GILFILLAN, 

ADDRESSEE: CECCHI, STEWART & OLSTEIN 

STREET: 6 BECKER FARM ROAD 

CITY: ROSELAND 

STATE: NEW JERSEY 
; COUNTRY: USA 

ZIP: 07068 
COMPUTER READABLE FORM: 

MEDIUM TYPE: 3.5 INCH DISKETTE 

COMPUTER: IBM PS/2 

OPERATING SYSTEM: MS-DOS 

SOFTWARE: WORD PERFECT 5.1 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: PCT/US95/05616 

FILING DATE: concurrently 

CLASSIFICATION: 
; ATTORNEY/AGENT INFORMATION: 

NAME: FERRARO, GREGORY D. 
; REGISTRATION NUMBER: 36,134 

REFERENCE/DOCKET NUMBER: 325800-268 
TELECOMMUNICATION INFORMATION: 

TELEPHONE: 201-994-1700 

TELEFAX: 201-994-1744 
; INFORMATION FOR SEQ ID NO: 4: 
SEQUENCE CHARACTERISTICS: 

LENGTH: 369 BASE PAIRS 

TYPE: AMINO ACID 
; STRANDEDNESS: SINGLE 

TOPOLOGY: LINEAR 



; MOLECULE TYPE: cDNA 
PCT-US95-05616-4 



Query Match 85.5%; Score 1897; DB 5; Length 369; 

Best Local Similarity 99.7%; Pred. No. 7.6e-164; 

Matches 362; Conservative 0; Mismatches 1; Indels 0; Gaps 0 

Qy 1 MEPSATPGAQMGVPPGSREPSPVPPDYEDEFLRYLWRDYLYPKQYEWVLIAAYVAVFWA 60 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II II I I I I I I I I 
Db 1 MEPSATPGAQMGVPPGSREPSPVPPDYEDEFLRYLWRDYLYPKQYEWVLIAAYVAVEWA 60 

Qy 61 LVGNTLVCIA.WRNHHMRTVTNYFIWLSLADVLVTAICLPASLLVDITESWLFGHALCK 120 

I I II I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I 
Db 61 LVGNTLVCLAWRNHHMRTWNYFIVNLSLADVLVTAICLPASLLVDITESWLFGHALCK 120 

Qy 121 VIP Y LQ AVS VS VAVLT L S F I AL D RW YAI CHPLLFKS T ARRARG S I L G I WAVS LAI MVP Q A 180 

I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 121 VIPYLQAVSVSVAVLTLSFIALDRWYAICHPLLFKSTARRARGSILGIWAVSLAIMVPQA 180 

Qy 181 AVMECSSVLPEL7\NRTRLFSVCDERWAI)DLYPKIYHSCFFIWYLAPLGLMAMAYFQIFR 240 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I 

Db 181 AVMECSSVLPELANRTRLFSVCDERWADDLYPKIYHSCFFIVTYLAPLGLMAMAYFQIFR 24 0 

Qy 241 KLWGRQ I PGTT SALVRNWKRP S DQLGDLEQGLS GE PQ PRARAFLAEVKQMRARRKTAKML 300 

I I II I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 241 KLWGRQIPGTTSALVRNWKRPSDQLGDLEQGLSGEPQPRGRAFLAEVKQMRARRKTAKML 300 

Qy 301 MWLLVFALC YL P I S VLNVLKRVFGMFRQAS D REAVYAC FT FS HWLVYAWSAAN P 1 1 YN F 360 

II I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I 

Db 301 MWLLVFALCYLPISVLNVLKRVFGMFRQASDREAVYACFTFSHWLVYANSAANPIIYNF 360 

Qy 361 LSG 363 

I I I 

Db 361 LSG 363 



RESULT 12 
US-08-462-509B-6 

; Sequence 6, Application US/08462509B 

; Patent No. 6410701 

; GENERAL INFORMATION: 

; APPLICANT: Soppet, Daniel et al 

TITLE OF INVENTION: Human Neuropeptide Receptor 

NUMBER OF SEQUENCES: 12 
; CORRESPONDENCE ADDRESS: 

; ADDRESSEE: Human Genome Sciences, Inc. 

STREET: 9410 Key West Avenue 
; CITY: Rockiville 

STATE: MD 
; COUNTRY: USA 

; ZIP: 20850 

; COMPUTER READABLE FORM: 

MEDIUM TYPE: Floppy disk 
; COMPUTER: IBM PC compatible 

; OPERATING SYSTEM: PC-DOS/MS-DOS 

SOFTWARE: Patentln Release #1.0, Version #1.30 
CURRENT APPLICATION DATA: 



APPLICATION NUMBER: US/ 0 8/ 4 62 , 509B 
FILING DATE: 05-JUN-1995 
CLASSIFICATION: 435 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: WO PCT/US95/ 05616 
FILING DATE: 05-MAY-1995 
ATTORNEY/AGENT INFORMATION: 
NAME: Wales, Michele M. 
REGISTRATION NUMBER: 43,975 
REFERENCE/DOCKET NUMBER: PF168P1 
TELECOMMUNICATION INFORMATION: 
TELEPHONE: 301-309-8 504 
TELEFAX: 301-309-8439 
INFORMATION FOR SEQ ID NO: 6: 
SEQUENCE CHARACTERISTICS: 
LENGTH: 372 amino acids 
TYPE: amino acid 
TOPOLOGY: linear 
MOLECULE TYPE: protein 
US-08-462-509B-6 

Query Match 85.4%; Score 1894; DB 4; Length 372; 

Best Local Similarity 99.4%; Pred. No. 1.4e-163; 

Matches 361; Conservative 1; Mismatches 1; Indels 0; Gaps 0 

Qy 1 ME P SAT P GAQMGVP P G S RE P S P VP P D YE D E FL R YLWRD Y L Y P KQ YEWVL I AAYVAVFVVA 60 

I I I I 1 I I I II I I I I I I I I : I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1 MEPSATPGAQMGVPPGSRDPSPVPPDYEDEFLRYLWRDYLYPKQYEWVLIAAYVAVFWA 60 

Qy 61 L VGN T L VC LAVW RN H HMRT VT N Y F I VN L S LAD VL VTAI C L PAS L LVD I T E S W L FGHAL C K 12 0 

I I I I M I I I M I II II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I t I I I I I I I 

Db 61 LVGNTLVCLAWRNHHMRTWNYFIWLSLADVLVTAICLPASLLVDITESWLFGHALCK 12 0 

Qy 121 VIPYLQAVSVSVAVLTLSFIALDRWYAICHPLLFKSTARRARGSILGIWAVSLAIMVPQA 180 

I I II I I I I M I I I I I I I I I I I I II I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I 

Db 121 VIPYLQAVSVSVAVLTLSFIALDRWYAICHPLLFKSTARRARGSILGIWAVSLAIMVPQA 180 

Qy 181 AVMECSSVLPELANRTRLFSVCDERWADDLYPKIYHSCFFIVTYLAPLGLMAMAYFQIFR 240 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I II I I I I I I 
Db 181 AVMECSSVLPEIANRTRLFSVCDERWADDLYPKIYHSCFFIVTYLAPLGLMAMAYFQIFR 240 

Qy 241 KLWGRQIPGTTSALVRNWKRPSDQLGDLEQGLSGEPQPRARAFLAEVKQMRARRKTAKML 300 

I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I 
Db 241 KLWGRQIPGTTSALVRNWKRPSDQLGDLEQGLSGEPQPRGRAFLAEVKQMRARRKTAKML 300 

Qy 301 MWLL'v^ALCYLPISVLNVLKRVFGMFRQASDREAVYACFTFSHWLVYANSAANPIliNF 360 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I II I I I 
Db 301 MWL L VFALC YL P I S VLNVL KRVF GM F RQ AS D REAVYAC FT FS HWLVYAN S AAN P 1 1 YN F 360 

Qy 361 LSG 363 

I I I 

Db 361 LSG 363 



RESULT 13 
US-09-426-290-2 

; Sequence 2, Application US/09426290 



Patent No. 6410712 
GENERAL INFORMATION: 
APPLICANT: Berglind Ran Olafsdottir 
APPLICANT: Jeffrey Gulcher 

TITLE OF INVENTION: HUMAN NARCOLEPSY GENE 
FILE REFERENCE: 2345.2001-000 
CURRENT APPLICATION NUMBER: US/ 09/42 6, 290 
CURRENT FILING DATE: 1999-10-25 
NUMBER OF SEQ ID NOS : 24 

SOFTWARE: FastSEQ for Windows Version 4.0 
SEQ ID NO 2 
LENGTH: 444 
TYPE: PRT 

ORGANISM: Homo Sapiens 
US-09-426-290-2 

Query Match 65.8%; Score 1460; DB 4; Length 444; 

Best Local Similarity 68.8%; Pred. No. 3.6e-124; 

Matches 284; Conservative 50; Mismatches 69; Indels 10; Gaps 5 

Qy 17 SREPSPVPPDYED-EFLRYLWRDYLYPKQYEWVLIAAYVAVFWALVGNTLVCLAVWRNH 75 

: : I I I I I : I I I I I I I M : II : I I : I I I I I I I I : I I I I I I : I I I I I : I I I : I I 
Db 24 TQEPFLNPTDYDDEEFLRYLWREYLHPKEYEWVLIAGYIIVFWALIGNVLVCVAVWKNH 8 3 

Qy 7 6 HMRTVTNYFIVNLSLADVLVTAICLPASLLVDITESWLFGHALCKVIPYLQAVSVSVAVL 135 

I I I I I I I I I I I I I I I I I I I I I I I I I : I : I I I I I : I II : I I I I I I I I I II I I I : I I 
Db 84 HMRTWNYFIWLSLADVLWITCLPATLWDITETWFFGQSLCKVIPYLQTVSVSVSVL 143 

Qy 136 TLSFIALDRWYAICHPLLFKSTARRARGSILGIWAVSLAIMVPQAAVMECSSVLPELANR 195 

I I I II I I I I I I I I I I I : I I I I I : I I I I I : I I I I I I : I I I I I I I I : I I I I I : 
Db 144 TLSCIALDRWYAICHPLMFKSTAKRARNS IVI IWIVSCI IMI PQAIVMECSTVFPGLANK 2 03 

Qy 196 TRLFSVCDERWADDLYPKIYHSCFFIVTYLAPLGLMAMAYFQIFRKLWGRQIPGTTSALV 255 

I I I : I I I I I I : : I I I : I I I I I : I I I : I I I II Ml I I I I I I I I I ! I I I : I : 
Db 204 TTLFTVCDERWGGEI YPKMYHI CFFLVTYMAPLCLMVLAYLQI FRKLWCRQI PGTSSWQ 263 

Qy 256 RNWK — RPSDQLGDLEQGLSGEPQPRARAFLAEVKQMRARRKTAKMLMWLLVFALCYLP 313 

III : I I : I : I I I I : I I : I I I I I I I : I I I I I I I I I I : I I I I 

Db 264 RKWKPLQPVSQ PRGPGQPTKSRMSAVAAEIKQIRARRKTARMLMWLLVFAICYLP 319 

Qy 314 I S VLN VL KRVFGMFRQAS DREAVYAC FT FS HWLVYAN S AAN P 1 1 YN FL S GKFREQ FKAAF 373 

I I : I I I I I I I I I I I III III I I I I I I I I I I I I I I I II I I II I I I I I I I : I I I I I 

Db 320 I S I LN VL KRVFGMFAHT ED RET VYAW FT F S HWLVYAN S AAN P 1 1 YN FL S GK FRE E FKAAF 379 

Qy 374 SCCLPGLGPCGSLKAPSPRSSA-SHKSLSLQ — SRCSVSKISEHWLTSVTTV 423 

Mil: : I M I MM I : : : I I : I I I I I M : M : 

Db 380 SCCCLGVHHRQEDRLTRGRTSTESRKSLTTQISNFDNISKLSEQWLTSISTL 432 



RESULT 14 
US-09-119-788-2 

Sequence 2, Application US/09119788 
Patent No. 6166193 
GENERAL INFORMATION: 

APPLICANT: Yanagisawa, Masashi 

TITLE OF INVENTION: CDNA CLONE MY1 THAT ENCODES 
TITLE OF INVENTION: A NOVEL HUMAN 7 -TRANSMEMBRANE RECEPTOR 



; NUMBER OF SEQUENCES: 2 

; CORRESPONDENCE ADDRESS: 

; ADDRESSEE: SmithKline Beecham Corporation 

STREET: 709 Swedeland Road 
CITY: King of Prussia 
STATE: PA 

COUNTRY: United States of America 
ZIP: 19406 
COMPUTER READABLE FORM: 
MEDIUM TYPE: Diskette 
COMPUTER: IBM Compatible 
OPERATING SYSTEM: DOS 

SOFTWARE: FastSEQ for Windows Version 2.0 
; CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/09/119,788 

FILING DATE: 21-JUL-1998 

CLASSIFICATION: 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: 60/053,790 
; FILING DATE: 25-JUL-1997 

ATTORNEY/AGENT INFORMATION: 

NAME: King, William T 

REGISTRATION NUMBER: 30,954 

REFERENCE/DOCKET NUMBER: GH50029 
TELECOMMUNICATION INFORMATION: 

TELEPHONE: 610-270-5515 

TELEFAX: 610-270-5090 

TELEX : 

; INFORMATION FOR SEQ ID NO: 2: 

; SEQUENCE CHARACTERISTICS: 

; LENGTH: 4 44 amino acids 

; TYPE: amino acid 

; STRANDEDNESS: single 

; TOPOLOGY: linear 

; MOLECULE TYPE: protein 

US-09-119-788-2 

Query Match 65.8%; Score 1459; DB 3; Length 444; 

Best Local Similarity 68.8%; Pred. No. 4.4e-124; 

Matches 284; Conservative 50; Mismatches 69; Indels 10; Gaps 5 

Qy 17 SREPSPVPPDYED-EFLRYLWRDYLYPKQYEWVLIAAYVAVFWALVGNTLVCLAVWRNH 75 

: : I I I I I : I I I I I I I I I : I I : I I : I I I I I II I : I I I I I I : I I I I I : I I I : I I 
Db 24 TQEPFLNPTDYDDEEFLRYLWREYLHPKEYEWVLIAGYIIVFWALIGNVLVCVAVWKNH 83 

Qy 7 6 HMRTWNYFIWLSLADVLVTAICLPASLLVDITESWLFGHALCKVIPYLQAVSVSVAVL 135 



Db 



84 




Qy 



136 



TLSFIALDRWYAICHPLLFKSTARRARGSILGIWAVSLAIMVPQAAVMECSSVLPELANR 195 



III II II II II I M II • I II II • I II I I - II II 11*111 I I I I I - I I I I I • 

TLSCIALDRWYAICHPLMFKSTAKRARNSIVIIWIVSCIIMIPQAIVMECSTVFPGLANK 2 03 




Db 



144 



196 



TRLFSVCDERWADDLYPKIYHSCFFIVTYLAPLGLMAMAYFQIFRKLWGRQIPGTTSALV 255 



Db 



204 



TTLFTVCDERWGGEIYPKMYHICFFLVTYMAPLCLMVLAYLQIFRKLWCRQIPGTSSWQ 263 



Qy 256 RNWK — RPSDQLGDLEQGLSGEPQPRARAFLAEVKQMRARRKTAKMLMWLLVFALCYLP 313 

III : I I : I : I I I I : I I : I I I I I I I : II I I I I I I I I : I I I I 

Db 264 RKWKPLQPVSQ P RGP GQ P T K S RMGAVAAE I KQ I RARRKT ARMLMWLLVFAI C YL P 319 

Qy 314 ISVLNVLKRVFGMFRQASDREAVYACFTFSHWLVYANSAANPIIYNFLSGKFREQFKAAF 373 

I I : I I I I I II I I II III III II I I I I I I I I I I I I I I I I I I I I I I I I I I : I I I I I 

Db 320 I S I LN VL KRVFGMFAHT E D RET VYAW FT F S HWL VYAN S AAN P 1 1 YN FL S GKFRE E FKAAF 379 

Qy 374 SCCLPGLGPCGSLKAPSPRSSA-SHKSLSLQ— SRCSVSKISEHWLTSVTTV 423 

III I : : I : I I I I I : I : : : I I : I I I I I I I : : I : 

Db 380 SCCCLGVHHRQEDRLTRGRTSTESRKSLTTQISNFDNISKLSEQWLTSISTL 432 



RESULT 15 
US-08-513-974B-54 

Sequence 54, Application US/08513974B 
Patent No. 6114139 
GENERAL INFORMATION: 

APPLICANT: Hinuma, Shuji 
APPLICANT: Hosoya, Masaki 
APPLICANT: Fujii, Ryo 
APPLICANT: Ohtaki, Tetsuya 
APPLICANT: Fukusumi, Shoji 
APPLICANT: Ohgi, Kazuhiro 

TITLE OF INVENTION: G PROTEIN COUPLED RECEPTOR PROTEIN, 
TITLE OF INVENTION: PRODUCTION, AND USE THEREOF 
NUMBER OF SEQUENCES: 380 
CORRESPONDENCE ADDRESS: 

ADDRESSEE: DIKE, BRONSTEIN, ROBERTS & CUSHMAN, LLP 
STREET: 130 Water Street 
CITY: Boston 
STATE: MA 
COUNTRY: USA 
ZIP: 02109 
COMPUTER READABLE FORM: 

MEDIUM TYPE: Floppy disk 
COMPUTER: IBM PC compatible 
OPERATING SYSTEM: PC-DOS/MS-DOS 

SOFTWARE: Patentln Release #1.0, Version #1.30 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/08/513, 974B 
FILING DATE: 14-SEP-1995 
CLASSIFICATION: 536 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: PCT/ JP95/01599 
FILING DATE: 10-AUG-1995 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: JP 7-093989 
FILING DATE: 19-AUG-1995 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: JP 7-057186 
FILING DATE: 16-MAR-1995 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: JP 7-007177 
FILING DATE: 20-JAN-1995 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: JP 6-326611 



FILING DATE : 28-DEC-1994 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: JP 6-270017 
FILING DATE: 02-NOV-1994 
; PRIOR APPLICATION DATA: 

APPLICATION NUMBER: JP 6-236357 
FILING DATE: 30-SEP-1994 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: JP 6-236356 
FILING DATE: 30-SEP-1994 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: JP 6-189274 
; FILING DATE: ll-AUG-1994 

PRIOR APPLICATION DATA: 

APPLICATION NUMBER: JP 6-189273 
FILING DATE: ll-AUG-1945 
; PRIOR APPLICATION DATA: 

APPLICATION NUMBER: JP 6-189272 
FILING DATE: ll-AUG-1994 
ATTORNEY/AGENT INFORMATION: 
NAME: Resnick, David S. 
REGISTRATION NUMBER: 34,235 
REFERENCE/DOCKET NUMBER: 45753 
TELECOMMUNICATION INFORMATION: 
TELEPHONE: 617-523-3400 
TELEFAX: 617-523-6440 
; INFORMATION FOR SEQ ID NO: 54: 
SEQUENCE CHARACTERISTICS: 
LENGTH: 2 63 amino acids 
; TYPE: amino acid 

STRANDEDNESS: 
TOPOLOGY: linear 
; MOLECULE TYPE: peptide 
US-08-513-974B-54 



Query Match 59.1%; Score 1311; DB 3; Length 263; 

Best Local Similarity 96.6%; Pred. No. 5.8e-lll; 

Matches 254; Conservative 2; Mismatches 7; Indels 0; Gaps 0 

Qy 91 AD VL VT AI C L PAS L LVD I T E S W L F GHAL C KVT P Y LQAV S VS VAVL T L S F I AL D RW Y AI C H 150 

I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I | I | I | | | | I I I I I I I I M I 
Db 1 ADVLVTAICLPASLLV^ITESWLFGHALCKVIPYLQAVSVSVVVLTLSSIALDRWYAICH 60 

Qy 151 PLLFKSTARRARGSILGIWAVSLAIMVPQAAVMECSSVLPELANRTRLFSVCDERWADDL 210 

I I I I I I I I I I I I I II I I I I I I I I I : I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db : 61 PLLFKSTARRARGSILGIWAVSLAVMVPQAAWECSSVLPEIJ^RVKLLSVCDERWADDL 12 0 

Qy 211 YPKIYHSCFFIVTYLAPLGLMAMAYFQIFRKLWGRQIPGTTSALVRNWKRPSDQLGDLEQ 270 

I I I I I M I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I II I I I I I I I I I I I I I I I 
Db 121 YPKIYHSCFFI VTYLAPLGLMAMAYFQIFRKLWGRQIPGTTSALVRNWKRPSDQLDDQGQ 18 0 

Qy 271 GLSGEPQPRARAFLAEVKQMRARRKTAKMLMWLLVFALCYLPISVLNVLKRVFGMFRQA 330 

Ml I I I M I I I I I I I I I I I I I I I I | I | | | I | | | I | M I I I I I I I I I I I I I I I I I I I I I I 
Db 181 GLSSEPQPRARAFLAEVKQMRARRKTAKMLMWLLVFALCYLPISVLNVLKRVFGMFRQA 240 



QY 



331 S D REAVYAC FT FS HWL VYAN S AA 353 
I II I I : I I I I I I I I I I I I I I I II 



Db 241 S DREAI YAC FT F S HWL VYAN S AA 263 

Search completed: September 28, 2004, 10:01:35 
Job time : 36 sees 



GenCore version 5.1.6 
Copyright (c) 1993 - 2004 Compugen Ltd. 



OM protein - protein search, using sw model 



Run on : 



Title: 

Perfect score: 
Sequence : 

Scoring table: 



September 28, 2004, 09:40:21 ; Search time 40 Seconds 

(without alignments) 
1022.035 Million cell updates/sec 

US-10-070-532-2 
2219 

1 ME P SAT P GAQMGVP P GS RE P CSVSKISEHWLTSVTTVLP 425 

BLOSUM62 

Gapop 10.0 , Gapext 0.5 



Searched: 283366 seqs, 96191526 residues 

Total number of hits satisfying chosen parameters: 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 



283366 



Database 



PIRJ78:* 
pirl : * 
pir2:* 
pir3 : * 
pir4 : * 



Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 

SUMMARIES 



Result 
No. 


Score 


Query 
Match 


Length 


DB 


ID 


Description 


1 


464.5 


20.9 


427 


2 


S50150 


gastric CCK-A rece 


2 


447. 5 


20.2 


449 


2 


A41738 


neuropeptide Y rec 


3 


434 


19.6 


381 


2 


139187 


neuropeptide Y/pep 


4 


432.5 


19.5 


370 


1 


152315 


G protein-coupled 


5 


428.5 


19.3 


428 


2 


JN0692 


cholecystokinin ty 


6 


421.5 


19.0 


444 


2 


A42685 


cholecystokinin re 


7 


419 


18.9 


407 


2 


S23510 


neurokinin 1 recep 


8 


418 


18.8 


407 


2 


A34357 


neurokinin 1 recep 


9 


417.5 


18.8 


519 


2 


S17783 


tachykinin recepto 


10 


417 


18.8 


407 


1 


JQ1274 


neurokinin 1 recep 


11 


417 


18.8 


407 


2 


S20304 


neurokinin 1 recep 


12 


417 


18.8 


430 


2 


151898 


cholecystokinin A 


13 


416 


18.7 


465 


1 


JQ1517 


neurokinin 3 recep 



14 


414.5 


1 o 
lo . 


1 


/ICO 


Z 


A J 4 y 1 b 


neurokinin 3 recep 


15 


413,5 


18 . 


6 


402 


2 


15 6595 


neurokinin 2 recep 


16 


413 


18 . 


6 


436 


2 


JC5599 


cholecystokinin-A 


17 


409.5 


18 . 


r: 
D 


a c o 

452 


Z 


JC2459 


gastrin/cholecysto 


18 


408.5 


18 . 


4 


385 


2 


S55524 


neurokinin 3 recep 


19 


4 06.5 


18 . 


3 


450 


2 


JQ1614 


gastrin receptor - 


20 


a do c; 
4 (Jo . 0 




z 


1 Q Q 


1 




neurokinin 2 recep 


21 


4 03.5 


18 . 


2 


/ICO 

452 


2 


A4 6 1 9 5 


cholecystokinin B 


22 


/I n o 

4 03 


1 o 

lo . 


Z 


O O A 

3o4 


1 


SU 051 6 


neurokinin 2 recep 


23 


4 01 


18 . 


1 


584 


2 


JC7809 


sulf akinin recepto 


24 


400.5 


18 . 


0 


423 


2 


B40470 


glucocorticoid-ind 


25 


400.5 


18 . 


0 


440 


2 


A4 4 0 8 1 


kappa-type opioid 


26 


399 


18 . 


0 


447 


2 


A4 7 4 3 0 


gastrin/ cholecysto 


27 


398 


17 . 


9 


349 


2 


159336 


galanin receptor 1 


28 


397 


17 . 


9 


384 


2 


157957 


neurokinin 2 recep 


29 


396 


17 . 


8 


390 


2 


A3 67 37 


neurokinin 2 recep 


30 


394 


17 . 


8 


453 


2 


S32817 


gastrin receptor - 


31 


393 . 5 


17 . 


7 


423 


2 


JC7677 


allatostatin recep 


32 


389 


17 . 


5 


504 


2 


A41783 


tachykinin recepto 


33 


385.5 


17 . 


4 


399 


2 


S29480 


bombesin receptor 


34 


384 


17 . 


3 


366 


2 


S71152 


neuropeptide Y/pep 


35 


383.5 


17 . 


3 


384 


2 


S20303 


neurokinin 2 recep 


36 


382 . 5 


17 . 


2 


443 


2 


D40470 


glucocorticoid-ind 


37 


381.5 


17 . 


2 


384 


2 


A3 9 003 


bombes in/gas tr in- r 


38 


379 


17 . 


1 


477 


1 


QRHUB1 


beta- 1 -adrenergic 


39 


378 . 5 


17 . 


1 


390 


2 


B41007 


bombesin receptor, 


40 


378 


17 . 


0 


394 


2 


JC72 09 


galanin receptor - 


41 


372.5 


16. 


8 


387 


2 


JC5949 


galanin receptor 2 


42 


372.5 


16. 


8 


480 


2 


153053 


beta 1 adrenergic 


43 


368.5 


16. 


6 


375 


2 


S63685 


neuropeptide Y rec 


44 


368.5 


16. 


6 


384 


2 


157682 


bombesin/ GRP rece 


45 


368.5 


16. 


6 


399 


2 


A46632 


bombesin-like pept 



ALIGNMENTS 



RESULT 1 
S50150 

gastric CCK-A receptor - rabbit 

C; Species: Oryctolagus cuniculus (domestic rabbit) 

C;Date: 14-Jul-1995 #sequence_revision 21-Jul-1995 #text_change 20-Apr-2000 
C; Access ion: S50150 

R; Reuben, M. ; Rising, L.; Prinz, C; Hersey, S.; Sachs, G. 
Biochim. Biophys . Acta 1219, 321-327, 1994 

A; Title: Cloning and expression of the rabbit gastric CCK-A receptor. 

A; Reference number: S50150; MUID: 95002144 ; PMID: 7918628 

A;Accession: S50150 

A; Status : preliminary 

A; Molecule type: mRNA 

A; Residues: 1-427 <REU> 

C; Superf amily : neurokinin 1 receptor 



Query Match 20.9%; Score 464.5; DB 2; 

Best Local Similarity 29.6%; Pred. No. 8e-32; 
Matches 128; Conservative 85; Mismatches 168; 



Length 427; 

Indels 51; Gaps 13; 



Qy 8 GAQMGVPPGSREPSPVPPDYEDEFLRYLWRDYLYPKQYEW VL I AAYVAVFWAL VGN 64 

I I : I I I :: | | I : I II I I I : | | | 

Db 9 GNASGIPP PCELGLDNETLFCLDQP PPSKEWQPAVQILLYSLIFLLSVLGN 59 

Qy 65 TLVCIAWRNHHMRTWNYFIWLSIADVLVTAICLPASLLVDITESWLFGHALCKVIPY 124 

III : I I | | | | | | I : : : I : : : I : : : I : I : I :::::: I I I I I I I 
Db 60 TLVITVLIRNKRMRTVTNIFLLSLAISDLMLCLFCMPFNLIPNLLKDFIFGSALCKTTTY 119 

Qy 125 L Q AV S V S VAVLT L S F I AL D RW Y AI CHPLLFK — S T AR RARG S I L G I W AVS LAI MVP Q AAV 182 

I I I I I : I I I : I : I : I I I I I : I I I I : I I II I 
Db 120 LMGTSVSVSTLNLVAISLERYGAICKPLQSRWQTKSHALKVIAATWCLSFAIMTPYPIY 179 

Qy 183 MECSSVLPELANRTRLFSVCDERWADDLYPKIYHSCFFIVTYLAPLGLMAMAYFQIFRKL 242 

I : : : I : : : I I : : : I : : : : I I : I : I I I : I 

Db 180 SNLVPFTKTNNQTANMCRFLLPSDVMQQAWHTFLLLILFLIPGIVMMVAYGMISLEL 236 

Qy 243 W-GRQIPGTTSALVRNWK RPSDQLGDLEQGLSGEPQPRARA 282 

s I : = : I = I : I I : I : I I I I I 

Db 237 YQGIKFDASQKKSAKERKASTGSGRFEDNDGCYLQRSKPTRQL-ELQQ-LSGGGGGRVSR 2 94 

Qy 2 83 F — LA£VKQMRARRKT7U<MLMVVLLVFALCYLPI SVLNVLKRVFGMFRQAS DREAVYACF 34 0 

: : I : : : : I I I I : : : : I I I : : I I I : : : : : I 
Db 295 I HS S S S AAALMAKKRVI RMLMVI WLFFLCWMP I FSANAWRAYDTV SAERRLSGTPI 351 

Qy 341 TFSHWLVYANSAANPIIYNFLSGKFREQFKAAFSCCLPGLGPCGSLKAPSPRSSASHKSL 400 

: I I I : I I I I I I I : : : I I I I I I I I II I I I : I : 

Db 352 SFILLLSYTSSCVNPIIYCFMNKRFRLGFMATFPCC-PNPGP PGPRAEAGEEEE 404 

Qy 401 SLQSRCSVSKIS 412 

: I 1:1: I 
Db 405 GRTTRASLSRYS 416 



RESULT 2 
A41738 

neuropeptide Y receptor - fruit fly (Drosophila melanogaster ) 
N;Alternate names: G protein-coupled receptor PR4 
C; Species: Drosophila melanogaster 

C;Date: 16-Sep-1992 #sequence_revision 16-Sep-1992 #text_change 20-Apr-2000 
C; Accession: A41738 

R; Li, X.J.; Wu f Y.N.; North, R.A. ; Forte, M. 
J. Biol. Chem. 267, 9-12, 1992 

A;Title: Cloning, functional expression, and developmental regulation of a 

neuropeptide Y receptor from Drosophila melanogaster. 

A; Reference number: A41738; MUID : 92112730 ; PMID: 1370455 

A; Accession: A41738 

A;Molecule type: mRNA 

A;Residues: 1-449 <LIA> 

A;Cross-references: GB:M81490; NID:gl57996; PIDN: AAA28727 . 1 ; PID:gl57997 
C; Genetics : 

A; Gene : FlyBase : NepYr 

A;Cross-ref erences : FlyBase : FBgn0004842 
C; Superf amily : neurokinin 1 receptor 

C; Keywords: appetite; G protein-coupled receptor; transmembrane protein 



Query Match 20.2%; Score 447.5; DB 2; Length 449; 

Best Local Similarity 29.4%; Pred. No. 2.4e-30; 



Matches 109; Conservative 63; Mismatches 138; Indels 61; Gaps 



9 



Qy 2 6 DYEDEFLRYLWRDYLYPKQYEWVLIAAWAVFWALVGNTLVCIAWRNHHMRTVTNYFI 85 

II: I I : : I : : I : I I : I I | | | : I I I I I I I I I 

Db 75 DYD LLSEDMWSSAYFKIIVYMLYIPIFIFALIGNGTVCYIVYSTPRMRTVTNYFI 129 

Qy 86 WLSLADVLVTAICLPASLL-VDITESWLFGHALCKVIPYLQAVSVSVAVLTLSFIALDR 144 

: I : : I : I : : I I : I : : I I II I I I : I I I I I I I : II I :: I I 
Db 130 ASLAIGDILMSFFCEPSSFISLFILNYWPFGLALCHFVNYSQAVSVLVSAYTLVAISIDR 189 

Qy 145 WYAICHPLLFKSTARRARGSILGIWAVSLAIMVPQAAVMECSSVLPELANRTRLFSVCDE 204 

: I I I I : I I I I I : I : : I I : I I : : : : I I 

Db 190 YIAIMWPLKPRITKRYATFI IAGVWFIALATALPI PIVSGLDI PMSPWHTKCEKY- ICRE 248 

Qy 205 RWADDLYPKIYHSCFFIVTYLAPLGLMAMAYFQIFRKLWGRQIPGTTSALVRNWKRPSDQ 264 

I I | : : : | I I : : I : I : : I : : I I : II 

Db 249 MWPSRSQEYYYTLSLFALQFWPLGVLIFTYARITIRVWAKRPPGEA ETNRDQ 301 

Qy 265 LGDLEQGLSGEPQPRARAFLAEVKQMRARRKTAKMLMWLLVFALCYLPISVLNVLKRVF 324 

: I : : I I I I : : I : : I I I : I I : : I : I 
Db 302 RMARS KRKMVKMMLTWI VFTCCWLP FN I LQLL 334 

Qy 325 GMFRQASDREAV YACFTFSHWLVYANSAANPIIYNFLSGKFREQFKAAFS 374 

: I I I I I I I I : : I I I I I : : : : I I I 

Db 335 LNDEEFAHWDPLPYVWFAF-HWLAMSHCCYNPIIYCYMNARFRSGFVQLMHRMPG 388 

Qy 375 CCLPGLG 381 

III : I 

Db 389 LRRWCCLRSVG 399 



RESULT 3 
139187 

neuropeptide Y/peptide YY receptor Y2 - human 

N;Alternate names: neuropeptide y/peptide YY receptor type 2 

C; Species: Homo sapiens (man) 

C;Date: Ol-Mar-1996 #sequence_revision Ol-Mar-1996 #text_change 20-Apr-2000 
C;Accession: 139187; 139163; G02301 
R;Gerald, C; Walker, M.W. ; Vaysse, P.J. 
J. Biol. Chem. 270, 26758-26761, 1995 

A;Title: Expression cloning and pharmacological characterization of a human 

hippocampal neuropeptide Y/peptide YY Y2 receptor subtype. 

A; Reference number: 139187; MUID : 96070760 ; PMID: 7592910 

A;Accession: 139187 

A; Status: preliminary 

A;Molecule type: mKNA 

A; Residues: 1-381 <GER> 

A;Cross-references : EMBL:U36269; NID : gl063633 ; PIDN : AAC502 8 1 . 1 ; PID:gl063634 
R;Rose, P.M.; Fernandes, P.; Lynch, J.S.; Frazier, S.T.; Fisher, S.M.; Kodukul 
K.; Kienzle, B. ; Seethala, R. 
J. Biol. Chem. 270, 22661-22664, 1995 

A; Title: Cloning and functional expression of a cDNA encoding a human type 2 
neuropeptide Y receptor. 

A;Reference number: 139163; MUID : 96032 678 ; PMID:7559383 
A;Accession: 139163 

A; Status: preliminary; translated from GB/EMBL/DDBJ 
A;Molecule type: mRNA 



A; Residues: 1-133, 'A', 135-381 <ROS> 

A;Cross-references : EMBL:U32500; NID : gl000750 ; PIDN : AAA93170 . 1 ; PID:gl000751 
R;Yan, H.; Yang, J.; Marasco, J.; Yamaguchi, K. ; Brenner, S.; Collins, F. ; 
Karbon, W. 

submitted to the EMBL Data Library, December 1995 
A;Reference number: H01019 
A;Accession: G02301 

A; Status: preliminary; translated from GB/EMBL/DDBJ 
A;Molecule type: mRNA 

A; Residues: 1-171, 'G',173, ' R 1 , 175-201 , 'P f ,203-208, 1 A' , 210-381 <YAN> 

A; Cross-references: EMBL:U42389; NID : gl314329 ; PIDN: AAB07 760 . 1; PID:gl314330 

C; Genetics : 

A; Gene: GDB : NPY2R 

A;Cross-references: GDB:4365607; OMIM:162642 

A; Map position: 4q31-4q31 

C; Superf amily : neurokinin 1 receptor 

C; Keywords: appetite; G protein-coupled receptor; glycoprotein; lipoprotein; 
thiolester bond; transmembrane protein 

F; 4 9-7 6/ Domain : transmembrane #status predicted <TM1> 
F; 87-113/Domain: transmembrane ((status predicted <TM2> 
F; 166-186/Domain : transmembrane ((status predicted <TM4> 
F;221-237/Domain: transmembrane #status predicted <TM5> 
F; 269-291/Domain : transmembrane ((status predicted <TM6> 
F; 305-32 8 /Domain : transmembrane ((status predicted <TM7> 
F; 123-203/Disulfide bonds: ((status predicted 

F;342/Binding site: palmitate (Cys) (covalent) ((status predicted 
F;372/Binding site: carbohydrate (Asn) (covalent) ((status predicted 

Query Match 19.6%; Score 434; DB 2; Length 381; 

Best Local Similarity 27.6%; Pred. No. 2.8e-29; 



Matches 


112; Conservative 74; Mismatches 140; Indels 80; Gaps 
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Qy 


3 


P SAT P GAQMGVP P G S RE P S P VP P D YEDE FLRYLWRD YL Y P KQ YEWVL I AAYVAVFWALV 
ill:: MM: | : : | | | | | : : : : : : 
PQTTPRGEL VPDPEPELI D S T KL I EVQWL I LAYC S 1 1 L LGVI 


62 


Db 


24 


66 


Qy 


63 


GNTLVCIA.WRNHHMRTWNYFIWLSLADVLVTAICLPASLLVDITESWLFGHALCKVI 

1 1 : 1 1 1 : 1 1 1 I 1 1 : 1 1 1 1 :: 1 1 : 1 1 : 1 1 1 : 1 : 1 1 II:: 
GNSLVIHWIKFKSMRTVTNFFIANLAVADLLVNTLCLPFTLTYTLMGEWKMGPVLCHLV 


122 


Db 


67 


126 


Qy 


123 


PYLQAVSVSVAVLTLSFIALDRWYAICHPLLFKSTARRARGSILGI-WAVSLAIMVPQAA 
1 1 1 : : 1 1 : : 1 1 : 1 1 1 1 1 1 : 1 1 : : 1 I : I : I : I : II 
PYAQGLAVQVSTITLTVIALDRHRCIVYHLESK-I SKRI SFLI IGLAWGI SALLASPLAI 


181 


Db 


127 


185 


Qy 


182 


VMECS — SVLPELANRTRLFSVCDERWADD LYPKIYHSCFFI VTYLAPLGLMAMAYF 

II : : I : 1 1 : 1 z : 1 : 1 : : | : | | | : : : : | 
FREYSLIEIIPDFE IVACTEKWPGEEKSIYGTVYSLSSLLILYVLPLGIISFSYT 


236 


Db 


186 


240 


Qy 


237 


QIFRKLWGRQIPGTTSALVRNWKRPSDQLGDLEQGLSGEPQPRARAFLAEVKQMRARRKT 

: 1 : II II 1 : 1 : 1 1 
RIWSKLKNHVSPGA ANDHYHQRRQKT 


296 


Db 


241 


266 


Qy 


297 


AKMLMVVLLVFALCYLPISVLNVLKRVFGMFRQASDREAVYACFTFSHWLVYANSAANPI 

1 1 1 : 1 : : 1 1 1 : : 1 1 1 : : : 1 1 : II 1 = : : M 1 : 

TKMLVCWWFAVSWLP LHAFQLAVDIDSQVLDLKEYKLIFTVFHIIAMCSTFANPL 


356 


Db 


267 


323 



Qy 



357 IYNFLSGKFREQFKAAFSCCLPGLGPCGSLKAPSPRSSASHKSLSL 402 



. I : ; ; • | • I • i i i i ii • i • 

Db 324 LYGWMNSNYRKAFLSAFRC EQRLDAIHSEVSV 355 



RESULT 4 
152315 

G protein-coupled receptor UHR-1 - rat 
C; Species: Rattus sp . (rat) 

C;Date: 10-Sep-1999 #sequence_revision 10-Sep-1999 #text_change 19-May-2000 
C;Accession: 152315 

R;Welch, S.K.; O'Hara, B.F.; Kilduff, T.S.; Heller, H.C. 
Biochem. Biophys . Res. Commun. 209, 606-613, 1995 

A; Title: Sequence and tissue distribution of a candidate G-coupled receptor 
cloned from rat hypothalamus. 

A;Reference number: 152315; MUID : 95251659 ; PMID:7733930 
A;Accession: 152315 

A; Status: preliminary; translated from GB/EMBL/DDBJ 
A;Molecule type: mRNA 
A; Residues: 1-370 <RES> 

A;Cross-references: GB:S77867; NID:g998527; PIDN : AAB34129 . 1; PID:g998528 
C; Super family : neurokinin 1 receptor 

Query Match 19.5%; Score 432.5; DB 1; Length 37 0; 

Best Local Similarity 32.2%; Pred. No. 3.7e-29; 

Matches 106; Conservative 66; Mismatches 116; Indels 41; Gaps 7; 



Qy 


44 


QYEWLIAAWAVFWALVGNTLVCIAWRNHHMRTWNYFIWLSLADVLVTAICLPAS 
I : : : : 1 1 1 1 1 1 1 1 1 : 1 : 1 : 1 II : 1 1 1 : 1 : 1 1 1 : 1 1 : 1 : 


103 


Db 


58 


QLKGLIVMLYSIWWGLVGNCLLVLVIARVRRLHNVTNFLIGNLALSDVLMCAACVPLT 


117 


Qy 


104 


LLVDI-TESWLFGHALCKVIPYLQAVSVSVAVLTLSFIALDRWYAICHPLLFKSTARRAR 

1 I : 1 1 1 1 : : : 1 1 1 : 1 1 : 1 1 1 : II : 1 1 : : 1 1 1 : : : : 
IA.YAFEPRGWVFGGGLCHLVFFLQPVTVYVSVFTLTTIAVDRYWLVHPLRRRISLKLSA 


162 


Db 


118 


177 


Qy 


163 


GSILGIWAVS LAIMVPQAAVMECSSVLPELANRTRLFSVCDERW-ADDLYPKIYHSCFFI 

: : II I t 1 : 1 : : 1 1 : 1 : II 1 : 1 1 : : : 1 1 : 
YAVL G I WAL S AVLAL P AAVHT YH VE L K P HDVRL -CEEFWGSQERQRQIYAWGLLL 


221 


Db 


178 


231 


Qy 


222 


VTYLAPLGLMAMAYFQIFRKLWGRQIPGTTSALVRNWKRPSDQLGDLEQGLSGEPQPRAR 

1 1 1 1 1 : : : 1 : : 1 1 1 : II : : : 1 III 
GTYLLPLLAILLSYVRVSVKLRNRWPGSVTQSQ7VDW DRAR 


281 


Db 


232 


272 


Qy 


282 


AFLAEVKQMRARRKTAKMLMVVLLVFALCYLP I SVLNVLKRVFGMFRQASDREAVYACFT 


341 


Db 


273 


II : 1 : | : | | : : | | | | 1 : 1 1 : : 1 : 1 : : : 1 1 1 
RRRTFCLLVVVVWFALCWLPLHIFNLLR DLDPRAI DPYAFGLVQL 


318 


Qy 


342 


FSHWLVYANSAANPIIYNFLSGKFREQFK 37 0 




Db 


319 


III : : : II II : 1 III:: 
LCHWLAMSSACYNPFIYAWLHDSFREELR 34 7 





RESULT 5 
JN0692 

cholecystokinin type A receptor - human 
C; Species: Homo sapiens (man) 

C;Date: 03-Feb-1994 #sequence_revision 03-Feb-1994 #text_change 20-Apr-2000 
C;Accession: JN0692; JN0590 



R;de Weerth, A.; Pisegna, J.R.; Huppi, K. ; Wank, S.A. 
Biochem. Biophys . Res. Commun. 194, 811-818, 1993 

A; Title: Molecular cloning, functional expression and chromosomal localization 

of the human cholecys tokinin type A receptor. 

A; Reference number: JN0692; MUID : 93343941 ; PMID: 8343165 

A;Accession: JN0692 

A;Molecule type: mRNA 

A; Residues: 1-428 <DEW> 

A;Cross-references: GB:L19315; NID:g306595; PIDN : AAA02819 . 1 ; PID:g306596 
A; Experimental source : gallbladder 

R;Ulrich, CD.; Ferber, I.; Holicky, E. ; Hadac, E . ; Buell, G. ; Miller, L.J. 
Biochem. Biophys. Res. Commun. 193, 204-211, 1993 

A; Title: Molecular cloning and functional expression of the human gallbladder 
cholecystokinin A receptor. 

A;Reference number: JN0590; MUID : 93277552 ; PMID: 8503909 
A; Accession: JN0590 

A; Status: nucleic acid sequence not shown 
A; Molecule type: DNA 
A; Residues: 1-42 8 <ULR> 

A;Cross-references: GB:L13605; NID:g306490; PIDN : AAA35659 . 1 ; PID:g306491 
A; Experimental source: gallbladder 

C; Comment: This protein has diverse physiological roles in the gastrointestinal 

system where it mediates pancreatic growth and enzyme secretion, smooth muscle 

contraction of the gallbladder and stomach, and secretion from gastric mucosal 

cells . 

C; Genetics : 

A; Gene: GDB : CCKAR 

A;Cross-references: GDB: 141927; OMIM: 118444 
A; Map position: 4pter-4qter 
C;Superfamily: neurokinin 1 receptor 

C;Keywords: G protein-coupled receptor; glycoprotein; hormone receptor; 
phosphoprotein; transmembrane protein 

F; 4 0-67 /Domain : transmembrane #status predicted <TM1> 
F;78-104/Domain: transmembrane #status predicted <TM2> 
F; 116- 137 /Domain : transmembrane #status predicted <TM3> 
F; 15 8- 17 8 /Domain : transmembrane ((status predicted <TM4> 
F;208-234/Domain: transmembrane ((status predicted <TM5> 
F;314-332/Domain: transmembrane ((status predicted <TM6> 
F; 3 5 0-3 69 /Domain : transmembrane ((status predicted <TM7> 

F;10,24, 190,299/Binding site: carbohydrate (Asn) (covalent) ((status predicted 
F;245,249,260,290/Binding site: phosphate (Ser) (covalent) (by protein kinase C) 
((status predicted 

F;256/Binding site: phosphate (Ser) (covalent) (by protein kinase A) ((status 
predicted 

Query Match 19.3%; Score 428.5; Dd 2; Length 428; 

Best Local Similarity 29.2%; Pred. No. 9.4e-29; 

Matches 125; Conservative 82; Mismatches 180; Indels 41; Gaps 13; 

Qy 16 GSREPSPVPPDYEDEFLRYLWRDYLYPKQYEW VL I AAYVAVFWAL VGNT LVC LAW 72 

I I 1 I : I I I I I : I I II I : I : : : : : I I I I I : 

Db 11 GSNITPPCELGLENETLFCL — DQPRPSK-EWQPAVQILLYSLIFLLSVLGNTLVITVLI 67 

Qy 73 RNHHMRTWNYFIVNLSLADVLVTAICLPASLLVDITESWLFGHALCKVIPYLQAVSVSV 132 

II I I I | M | : : : | : : : I : : : I : I : I :::::: I I I : I I I I I II 

Db 68 RNKRMRTVTNI FLLSLAVS DLMLCLFCMP FNLI PNLLKDFI FGSAVCKTTT YFMGT S VSV 127 



Qy 133 AVLTLSFIALDRWYAICHPLLFK — STARRARGS I LGIWAVS LAIMVPQAAVMECS SVLP 190 

: I I : I : I : I I I I I : I I I I : I ill I : : : I 

Db 128 STFNLVAISLERYGAICKPLQSRVWQTKSHALKVIAATWCLSFTIMTPYPIY SNLVP 184 

Qy 191 ELANRTRLFSVCDERWADDLYPKIYHSCFFIWYLAPLGLMAl^YFQIFRKLW 243 

| : : : I : | : : : I : : : : I I : I : I I I : I : 

Db 185 FTKNNNQTANMCRFLLPNDVMQQSWHTFLLLILFLIPGIVMMVAYGLISLELYQGIKFEA 244 

Qy 244 GRQIPGTTSA LVRNWKRPSD-QLGDLEQGLSGEPQPRARAFLAEV 287 

: I I I I : : : : I : I I I I I I : : 

Db 245 SQKKSAKERKPSTTSSGKYEDSDGCYLQKTRPPRKLELRQLSTGSSSRAN-RIRS-NSSA 302 

Qy 288 KQMRARRKTAKMLMWLLVFALCYLPISVLNVLKRVFGMFRQASDREAVYACFTFSHWLV 347 

: I : : : : I I : I : : : : I I I : : I I I : : : : I : I I 

Db 303 ANLMAKKRVIRMLIVIWLFFLCWMPIFSANAWR AYDTASAERRLS GT P I S FI LLLS 359 

Qy 348 YANSAANPIIYNFLSGKFREQFKAAFSCCLPGLGPCGSLKAPSPRSSASHKSLSLQSRCS 407 

1:1 I I I M I : : : I I I I I I I I II I : I I I I I 

Db 360 YTSSCVNPIIYCFMNKRFRLGFMATFPCC-PNPGPPGARGEVGEEEEGGTTGASL-SRFS 417 

Qy 408 VSKISEHV 415 

I : I I 

Db 418 YSHMSASV 42 5 



RESULT 6 
A42685 

cholecystokinin receptor type A - rat 

C; Species: Rattus norvegicus (Norway rat) 

C;Date: 04-Mar-1993 #sequence_revision 18-Nov-1994 #text_change 20-Apr-2000 
C;Accession: A42685; JC4225; PC2213 

R;Wank, S.A.; Harkins, R. ; Jensen, R.T.; Shapira, H. ; de Weerth, A.; Slattery, 
T. 

Proc. Natl. Acad. Sci. U.S.A. 89, 3125-3129, 1992 

A; Title: Purification, molecular cloning, and functional expression of the 

cholecystokinin receptor from rat pancreas. 

A; Reference number: A42685; MUID : 92212 981 ; PMID: 1313582 

A; Access ion: A42 68 5 

A; Status : preliminary 

A;Molecule type: mRNA 

A; Residues: 1-444 <WAN> 

A;Cross-references: GB:M88096; NID:g203383; PIDN : AAA40899 . 1 ; PID:g203384 
A; Experimental source: pancreas 

A;Note: sequence extracted from NCBI backbone (NCBIN : 93814 , NCBIP: 93815) 

R;Takata, Y. ; Takiguchi, S.; Funakoshi, A.; Kono, A. 

Biochem. Biophys. Res. Commun . 213, 958-966, 1995 

A;Title: Gene structure of rat cholecystokinin type-A receptor. 

A;Reference number: JC4225; MUID : 95382 845 ; PMID:7654260 

A; Accession: JC4225 

A; Status: preliminary 

A;Molecule type: DNA 

A; Residues: 1-223 <TAK> 

A;Cross-references: DDBJ:D50608; NID:gll00752 
R;Mantamadiotis, T.; Baldwin, G.S. 

Biochem. Biophys. Res. Commun. 201, 1382-1389, 1994 

A; Title: The seventh transmembrane domain of gastrin/CCK receptors contributes 
to non-peptide antagonist binding. 



A; Reference number: PC2213; MUID: 94296413; PMID: 8024583 
A; Accession: PC2213 
A; Status: preliminary 
A;Molecule type: protein 
A; Residues: 366-389 <MAN> 

C; Comment: This G-protein-coupled receptor is present in the gastrointestinal 
system, vagus nerve and localized areas of the central nervous system. It 
mediates pancreatic growth and enzyme secretion, smooth muscle contraction of 
the gallbladder and stomach. It is capable of activating phospholipase C and 
stimulating trunsduction by increasing levels of diacylglycerol, inositol 
phosphate, and inducing the subsequent release of intracellular calcium. 
C; Genetics : 
A; Gene: CCKAR 

A;Introns: 53/1; 137/1; 224/2; 267/1 
C; Superf amily : neurokinin 1 receptor 

C;Keywords: G protein-coupled receptor; hormone receptor; transmembrane protein 

F; 57- 8 2 /Domain : transmembrane #status predicted <TM1> 

F; 93- 11 9/ Domain: transmembrane #status predicted <TM2> 

F; 13 1-151 /Domain : transmembrane #status predicted <TM3> 

F; 17 3- 193 /Domain : transmembrane #status predicted <TM4> 

F;225-249/Domain: transmembrane #status predicted <TM5> 

F; 330-34 8 /Domain: transmembrane #status predicted <TM6> 

F; 366-389/Domain: transmembrane ftstatus predicted <TM7> 

Query Match 19,0%; Score 421.5; DB 2; Length 444; 

Best Local Similarity 28.1%; Pred. No. 3.9e-28; 

Matches 120; Conservative 83; Mismatches 171; Indels 53; Gaps 12; 



Qy 


16 


GSREPSPVPPDYEDEFLRYLWRDYLYPKQYEW VL I AA YVAV FWAL VGN T L VC LAVW 

II I 1 : 1 1 1 1 1 : 1 1 : 1 1 : | : : : : : I I I I I : 
GSNITPPCELGLENETLFCL— DQPQPSK-EWQSALQILLYSIIFLLSVLGNTLVITVLI 


72 


Db 


26 


82 


Qy 


73 


RNHHMRTVTNYFIVNLSLADVLVTAICLPASLLVDITESWLFGHALCKVIPYLQAVSVSV 

II | I | I | I I :: : 1 : : : 1 : : : 1 : 1 : 1 : : : : : : 1 1 1 : 1 1 1 1 ! 1 1 
RNKRMRT VTNI FLLSLAVSDLMLCLFCMPFNLI PNLLKDFI FGSAVCKTTTYFMGTSVSV 


132 


Db 


83 


142 


Qy 


133 


AVLTLSFIALDRWYAICHPLLFK — STARRARGS I LGI WAVSLAIMVPQAAVMECS S VLP 
: I 1 : 1 : 1 : 1 1 1 1 1 : 1 1 1 1 : 1 III 1 :: : 1 
STFNLVAISLERYGAICRPLQSRVWQTKSHALKVIAATWCLSFTIMTPYPIY SNLVP 


190 


Db 


143 


199 


Qy 


191 


ELANRTRLFSVCDERWADDLYPKIYHSCFFIVTYLAPLGLMAMAYFQIFRKLW-GRQIPG 


249 


Db 


200 


I : : : | 1 : : : : : : 1 1 : 1 : 1 1 1 : 1 : 1 : 
FTKNNNQTANMCRFLLPSDAMQQSWQTFLLLILFLLPGIVMWAYGLISLELYQGIKFDA 


259 


Qy 


250 


TTSALVRNWKRPSD- QLGDLEQGLSGEPQPRARAFLAE 

: : 1 : 1 1 : 1 1 1 1 I- h : 


286 


Db 


260 


SQKKSAKE-KKPSTGSSTRYEDSDGCYLQKSRPPRKLELQQLSSGSGGSRLNRIRS-SSS 


317 


Qy 


287 


VKQM RARR KT AKM LMWL L VFAL C Y L P I S VLN VL K RVFGMF RQ AS D REAVY AC FT F S H W L 
: | : : : : | | : | : : : : I I I : : I 1 1 : : : : : : : 1 1 
AANLIAKKRVI RMLI VI WLFFLCWMPI FSANAWRAYDTV SAEKHLSGTPISFILLL 


346 


Db 


318 


374 


Qy 


347 


VYANSAANPIIYNFLSGKFREQFKAAFSCCLPGLGPCG SLKAPSPRSS 

1:1 1 1 1 II 1 : : : 1 1 1 1 1 1 1 1 III : : : 1 II 
SYTSSCVNPIIYCFMNKRFRLGFMATFPCC-PNPGPPGVRGEVGEEEDGRTIRALLSRYS 


394 


Db 


375 


433 


Qy 


395 


ASHKSLS 4 01 





Db 4 34 YSHMSTS 44 0 



RESULT 7 
S23510 

neurokinin 1 receptor - guinea pig 

N; Alternate names: NK-1 receptor; substance P receptor 
C; Species: Cavia porcellus (guinea pig) 

C;Date: 07-Apr-1994 #sequence_revision 07-Apr-1994 #text__change 19-May-2000 
C;Accession: S23510; S19198 

R;Gorbulev, V.; Akhundova, A.; Luzius, H. ; Fahrenholz, F. 
Biochim. Biophys . Acta 1131, 99-102, 1992 

A; Title: Molecular cloning of substance P receptor cDNA from guinea-pig uterus. 
A; Reference number: S23510; MUID : 92256498 ; PMID: 1374648 
A;Accession: S23510 
A;Molecule type: mRNA 
A; Residues: 1-407 <GOR> 

A;Cross-references: EMBL:X64323; NID:g49565; PIDN : CAA45608 . 1 ; PID:g49566 
C; Superf amily : neurokinin 1 receptor 

C; Keywords: G protein-coupled receptor; glycoprotein; transmembrane protein 

F;32-55/Domain: transmembrane #status predicted <TM1> 

F; 6 9- 8 9 /Domain : transmembrane ^status predicted <TM2> 

F; 117 -12 8 /Domain: transmembrane #status predicted <TM3> 

F; 14 9- 16 9/ Domain : transmembrane #status predicted <TM4> 

F; 196-2 17/ Domain : transmembrane #status predicted <TM5> 

F;250-280/Domain: transmembrane #status predicted <TM6> 

F; 2 8 9-3 08 /Domain : transmembrane #status predicted <TM7> 

Query Match 18.9%; Score 419; DB 2; Length 407; 

Best Local Similarity 27.6%; Pred. No. 5.7e-28; 

Matches 110; Conservative 71; Mismatches 145; Indels 72; Gaps 11; 



Qy 


42 


PKQY EWVL I AAYVAVFWALVGNTLVCLAVWRNHHMRT VTN YFI VNLS LADVLV 

||: 1 : : III : 1 : : 1 1 1 : 1 : : 1 1 1 1 1 1 1 1 : 1 1 1 : 1 : : 
P N Q FVQ P AWQ I VLWAAA YT VI WT S WGNVWMW 1 1 LAH K RMRT VTN Y FL VN LAFAEASM 


95 


Db 


22 


81 


Qy 


96 


TAICLPASLLVDITESWLFGHALCKVIPYLQAVSVSVAVLTLSFIALDRWYAICHPLLFK 

1 : : 1 : 1 II : : 1 : : : : : : | I | : t 1 1 1 1 : 
AAFNTWNFTYAVHNEWYYGLFYCKFHNFFPIAAVFASIYSMTAVAFDRYMAIIHPLQPR 


155 


Db 


82 


141 


Qy 


156 


STARRARGSILGIWAVSIAIMVPQAAVMECSSVLPELANRTRLFSVCDERW ADDLYP 

: 1 : 1 1 1 : : 1 : 1 1 1 : 1 II 1 1 : 1 


212 


Db 


142 


LSATATKWI CVIWVLALLLAFPQGYY STTETMPGRV VCMIEWPSHPDKI YE 


193 


Qy 


213 


KIYHSCFFIVTYLkPLGLMAMAYFQIFRKLWGRQIPGTTSALVRNWKRPSDQLGDLEQGL 
1:111 : : 1 1 1 : : II : II : 1 1 1 : 1 

KVYHICVTVLIYFLPLLVIGYAYTWGITLWASEIPGDSSDRYH 


272 


Db 


194 


237 


Qy 


273 


S GE P Q P RARAFLAEVKQMRARRKT AKMLMWL LVFAL C YL P I S VLNVL KRVFGMFRQAS D 

: 1 : 1 : 1 1 1 1 : : II : 1 1 : 1 : 1 1 : : 1 : 
EQVSAKRKWKMMIVWCTFAICWLPFHIFFLLPYINPDLYLKKF 


332 


Db 


238 


282 


Qy 


333 


REAVYACFTFSHWLVYANSAANPIIYNFLSGKFREQFKAAFSCCLPGLGPCGSLKAPSPR 

: 1 1 II : : : 1 1 1 1 1 1 : : 1 1 II Mill: 
I QQVYLAIM WLAMS STMYNP 1 1 YCCLNDRFRLGFKHAFRCC- PFI 


392 


Db 


283 


326 



Qy 



393 SSASHKSLSL QSRCSVSKISEHWLTSVTTVL 424 



Db 



327 SAADYEGLEMKSTRYFQTQGSVYKVSR — LETTISTW 362 



RESULT 8 
A34357 

neurokinin 1 receptor - rat 

N; Alternate names: NK-1 receptor; substance P receptor 
C; Species: Rattus norvegicus (Norway rat) 

C;Date: 22-Jan-1993 #sequence_revision 22-Jan-1993 #text_change 19-May-2000 
C;Accession: A38692; A34357; A40089 
R;Hershey, A.D.; Dykema, P.E.; Krause, J.E. 
J. Biol. Chem. 266, 4366-4374, 1991 

A;Title: Organization, structure, and expression of the gene encoding the rat 
substance P receptor. 

A; Reference number: A38692; MUID: 91154239; PMID: 1705552 

A;Accession: A38692 

A;Molecule type: DNA 

A; Residues: 1-407 <HER> 

A; Cross-ref erences : GB:M34751 

R;Yokota, Y. ; Sasai, Y. ; Tanaka, K. ; Fujiwara, T.; Tsuchida, K. ; Shigemoto, R. 
Kakizuka, A.; Ohkubo, H.; Nakanishi, S. 
J. Biol. Chem. 264, 17649-17652, 1989 

A;Title: Molecular characterization of a functional cDNA for rat substance P 
receptor . 

A;Reference number: A34357; MUID : 90036822; PMID:2478537 
A;Accession: A34357 
A; Molecule type: mRNA 
A; Residues: 1-407 <YOK> 

A; Cross-references: GB:J05097; NID:g207051; PIDN : AAA42175 . 1 ; PID:g207052 
R;Hershey, A.D.; Krause, J.E. 
Science 247, 958-962, 1990 

A; Title: Molecular characterization of a functional cDNA encoding the rat 
substance P receptor. 

A; Reference number: A40089; MUID: 90161991; PMID:2154852 
A;Accession: A40089 
A; Status : preliminary 
A; Molecule type: mRNA 

A;Residues: 1-74, f R ! , 76-212, f A f , 214-407 <HE2> 

A; Cross-ref erences : GB:M31477 

C; Superf amily : neurokinin 1 receptor 

C;Keywords: G protein-coupled receptor; glycoprotein; membrane protein 

Query Match 18.8%; Score 418; DB 2; Length 407; 

Best Local Similarity 27.6%; Pred; No. 7e-28; 

Matches 112; Conservative 69; Mismatches 149; Indels 76; Gaps 8 
Qy 2 9 DEFLRYLWRDYLYPKQYEWLIAAWAVFWALVGNTLVCIAWRNHHMRTVTNYFIVNL 88 



Db 



23 NQFVQPTWQIVLW- 




QY 



8 9 SLADVLVTAICLPASLLVDITESWLFGHALCKVIPYLQAVSVSVAVLTLSFIALDRWYAI 148 



Db 



. i • i • • i*i ii • i ii" ii 

75 AFAEACMAAFNTWNFTYAVHNVWYYGLFYCKFHNFFPIAALFASIYSMTAVAFDRYMAI 134 



Qy 149 CHPLLFKSTARRARGSILGIWAVSLAIMVPQA- 



•AVMECSSVLPELANRTRLF 199 



I I I : : I : | | | : : | : | | : : I I I I I I 
Db 135 IHPLQPRLSATATKWIFVIWVLALLLAFPQGYYSTTETMPSRWCMIEWPEHPNRT 191 

Qy 2 00 SVCDERWADDLYPKIYHSCFFIVTY^PLGLMAMAYFQIFRKLWGRQIPGTTSALVRNWK 259 

I I II I : : I II:: II : II : I M : I 
Db 192 YEKAYHICVTVLIYFLPLLVIGYAYTWGITLWASEIPGDSSDRYH 237 

Qy 260 RPSDQLGDLEQGLSGEPQPRARAFLAEVKQMRARRKTAKMLMWLLVFALCYLPISVLNV 319 

: I : I : I I I I : : I I : I I : I : I I I : 
Db 238 EQVSAKRKWKMMIVVVCTFAI CWLPFHVFFL 269 

Qy 320 LKRVFGMFRQASDREAVYACFTFSHWLVYANSAANPII YNFLSGKFREQFKAAFSCC-LP 37 8 

I : : I I III : : : I I I I I I : : I I I I I I I I 

Db 270 LPYINPDLYLKKFIQQVYLA SMWLAMSSTMYNPIIYCCLNDRFRLGFKHAFRCCPFI 32 6 

Qy 379 GLGPCGSLKAPSPRSSASHKSLSLQSRCSVSKISEHWLTSVTTVL 424 

I I : I I | | : : | | | : | : | : : : | | : 

Db 327 SAGDYEGLEMKSTR YLQTQSSVYKVSR--LETTISTW 362 



RESULT 9 
S17783 

tachykinin receptor homolog DTKR - fruit fly (Drosophila melanogaster) 
C; Species: Drosophila melanogaster 

C;Date: 22-Nov-1996 #sequence_revision 22-Nov-1996 #text_change 20-Apr-2000 
C;Accession: S17783 

R; Li, X.J.; Wolfgang, W.; Wu, Y.N.; North, R.A. ; Forte, M. 
EMBO J. 10, 3221-3229, 1991 

A; Title: Cloning, heterologous expression and developmental regulation of a 

Drosophila receptor for tachykinin-like peptides. 

A;Reference number: S17783; MUID : 92007772 ; PMID:1717263 

A;Accession: S17783 

A;Molecule type: mRNA 

A; Residues: 1-519 <LIX> 

A;Cross-references: EMBL:X62711; NID:g8505; PIDN : CAA44595 . 1 ; PID:g8506 
A; Note: the sequence from Fig. 2 is inconsistent with that from Fig. 1 in 
lacking 481-Gly 
C; Genetics : 

A; Gene : FlyBase : Takr99D 

A/Cross-references : FlyBase : FBgn0004 622 
C; Super family : neurokinin 1 receptor 

C;Keywords: G protein-coupled receptor; neurotransmitter receptor; transmembrane 
protein 

Query Match 18.8%; Score 417.5; DB 2; Length 519; 

Best Local Similarity 28.1%; Pred. No. le-27; 

Matches 113; Conservative 62; Mismatches 154; Indels 73; Gaps 10; 

Qy 3 P SAT P GAQMGVP P GS RE P S PVPPDYED EFLRYLWRDYLY 41 

II I I I I I I I I I : I I I 

Db 45 PCRTLARSSPYPPVS FNHSQTLSTDQPAVGDVEDAAEDAAASMETGSFAFVVPWWRQVL- 103 

Qy 42 P KQ Y EWVL I AA YVAV FWAL VGN T L VC LAW RN H HMRT VT N Y F I VN L S LAD VL VT AI CLP 101 

I : : : : : I I I I : I I I I I I I I I I I I I I I : I I : I : : : : 

Db 104 WS I L — FGGMVI VATGGNLI VAAVIVMTTKRMRTVTNYFIVNLS IADAMVS SLNVT 156 



Qy 102 ASLLVDITESWLFGHALCKVIPYLQAVSVSVAVLTLSFIALDRWYAICHPLLFKSTARRA 161 



Ill II* • • • I • • I II II II . I 

Db 157 FNYYYMLDSDWPFGEFYCKLSQFIAMLSICASVFTLMAISIDRYVAIIRPLQPRMSKRCN 216 

Qy 162 RGSILGIWAVSLAIMVPQAAVMECSSV-LPELANRTRLFSVCDERWAD DLYPKIY 215 

I I I I I : | : | : | | | | | | | : | 

Db 217 LAIAAVIWLASTLI SCPMMI I YRTEEVPVRGLSNRT VCYPEWPDGPTNHSTMESLY 272 

Qy 216 HSCFFIVTYLAPLGLMAMAYFQI FRKLWGRQI PGTTSALVRNWKRPSDQLGDLEQGLSGE 275 

: | : | | | : | : | :: : | M : M 
Db 273 NILIIILTYFLPIVSMTVTYSRVGIELWGSK TIGE 307 

Qy 27 6 PQPRARAFLAEVKQMRARRKTAKMLMVVLLVFALCYLPISVLNVLKRVFGMFRQASDREA 335 

II : | : : | : : | : | | : : | | : | : || : | : | | : : : : | s 

Db 308 CTPR QVENVRSKRRWKMMIVWLIFAI CWLPFHSYFIITSCYPAITEAPFIQE 361 

Qy 336 VYACFTFSHWLVYANSAANPIIYNFLSGKFREQFKAAFSCCL 377 

: I : I I : I I I I I I I ::: : I I II I II 

Db 362 LYLAI YWLAMSNSMYNP 1 1 YCWMNS RFRYGFKMVFRWCL 400 



RESULT 10 
JQ1274 

neurokinin 1 receptor - human 

N;Alternate names: NK-1 receptor; substance P receptor 
C; Species: Homo sapiens (man) 

C;Date: 20-Apr-2000 #sequence_revision 20-Apr-2000 #text_change 20-Apr-2000 
C;Accession: A41134; JQ1274; JH0478; S21188 

R;Gerard, N.P.; Garraway, L.A. ; Eddy Jr., R.L.; Shows, T.B.; Iijima, H. ; Paquet, 
J . L . ; Gerard, C . 

Biochemistry 30, 10640-10646, 1991 

A; Title: Human substance P receptor (NK-1) : organization of the gene, chromosome 
localization, and functional expression of cDNA clones. 
A; Reference number: A41134; MUID: 92031510; PMID: 1657150 
A; Access ion: A41134 
A; Molecule type: DNA 

A;Residues: 1-328 , ' G ' , 32 9-332 , 334-4 07 <GER> 
A;Cross-references : GB:M76675; NID:gl89231 

A;Note: in the authors 1 translation 333-Gly is shown before residue 329 and, 
consequently, residues 329-332 are displaced one codon to the right 
R;Takeda, Y.; Chou, K.B.; Takeda, J.; Sachais, B.S.; Krause, J.E. 
Biochem. Biophys. Res. Commun. 179, 1232-1240, 1991 

A;Title: Molecular cloning, structural characterization and functional 

expression of the human substance P receptor. 

A; Reference number: JQ1274; MUID : 92028856; PMID:1718267 

A; Accession: JQ1274 

A; Molecule type: mRNA 

A; Residues: 1-407 <TAK1> 

A; Cross-references : GB:M74290; NID:g338612; PIDN : AAA60601 . 1 ; PID:g338613 
R;Hopkins, B.; Powell, S.J.; Danks, P.; Briggs, I.; Graham, A. 
Biochem. Biophys. Res. Commun. 180, 1110-1117, 1991 

A;Title: Isolation and characterization of the human lung NK-1 receptor cDNA. 
A; Reference number: JH0478; MUID : 92062052 ; PMID: 1659396 
A; Accession: JH0478 
A;Molecule type: mRNA 
A; Residues: 1-407 <HOP> 

A; Cross-references: GB:S62045; NID:g237994; PIDN :AAB2 0168 . 1 ; PID:g237995 
A; Experimental source: lung 



A;Note: the authors translated the codon CAA for residue 31 as Glu 
R;Takahashi, K.; Tanaka, A.; Hara, M. ; Nakanishi, S. 
Eur. J. Biochem. 204, 1025-1033, 1992 

A; Title: The primary structure and gene organization of human substance P and 
neuromedin K receptors. 

A; Reference number: S21188; MUID: 92201186; PMID: 1312928 

A;Accession: S21188 

A; Status: preliminary 

A; Molecule type: DNA 

A; Residues: 1-4 07 <TAK2> 

A; Cross-references : GB:X65177; NID:g36636; PIDN: CAA46292 . 1; PID:g82572l 

C;Comment: The endogenous ligand of this receptor is neurokinin 1 (substance P) , 

one of the peptides in the mammalian tachykinin system. 

C; Genetics: 

A; Gene: GDB : TAC1R 

A;Cross-references : GDB: 128977; OMIM: 162323 

A;Map position: 2pter-2qter 

C; Superf amily : neurokinin 1 receptor 

C; Keywords: G protein-coupled receptor; glycoprotein; hormone receptor; 
transmembrane protein 

F; 32-5 8 /Domain : transmembrane istatus predicted <TM1> 

F; 69-92/Domain: transmembrane #status predicted <TM2> 

F; 110-128/Domain: transmembrane #status predicted <TM3> 

F; 14 9- 168 /Domain : transmembrane #status predicted <TM4> 

F; 195-221/Domain: transmembrane #status predicted <TM5> 

F;249-273/Domain: transmembrane ((status predicted <TM6> 

F;286-308/Domain: transmembrane #status predicted <TM7> 

F; 14, 18/Binding site: carbohydrate (Asn) (covalent) #status predicted 

F; 105-180/Disulfide bonds: ((status predicted 

Query Match 18.8%; Score 417; DB 1; Length 407; 

Best Local Similarity 28.0%; Pred. No. 8.5e-28; 

Matches 110; Conservative 69; Mismatches 152; Indels 62; Gaps 10; 

Qy 42 PKQY EWVLI AAWAVFWALVGNTLVCLAWRNHHMRTVTNYFIVNLSLADVLV 95 

II: I :: Ml : I : : I I I : I : : I I I I I I I I : I I I : I : : 

Db 22 PNQFVQPAWQIVLWAAAYTVI VVTSWGN^ 81 

Qy 96 TAICLPASLLVDITESWLFGHALCKVIPYLQAVSVSVAVLTLSFIALDRWYAICHPLLFK 155 

I : : I : I II : : I :::::: | | | : | | | | | : 

Db 82 AAFNTVVNFTYAVHNEWYYGLFYCKFHNFFPIAAVFASIYSMTAVAFDRYMAIIHPLQPR 141 

Qy 156 STARRARGSILGIWAVS LAIMVPQAAVMECSSVLPELANRTRLFSVCDERWAD DLYP 212 

: I : I I I : : I : I I I : : I | | | : : | 

Db 142 LSATATKWICVIWVLALLLAFPQGYY STTETMPSRV VCMIEWPEHPNKI YE 193 

Qy 213 KIYHSCFFIVTYLAPLGLMAMAYFQIFRKLWGRQIPGTTSALVRNWKRPSDQLGDLEQGL 272 

I : I I I : : I II:: II : II : I I I : I 
Db 194 KVYH I CVT VL I Y FL P L L VI G YA YT WG I T LWAS EIPGDSSD R YH 237 

Qy 273 SGEPQPRARAFIiAEV^QMRARRKTAKMLMWLLVFALCYLPISVLNVLKRVFGMFRQASD 332 

: I : I : II I I : : I I : I I : I : I I : : I : 
Db 238 EQVSAKRKWKMMIWVCTFAI CWLPFHIFFLLPYINPDLYLKKF 2 82 

Qy 333 REAVYACFTFSHWLVYANSAANPIIYNFLSGKFREQFKAAFSCC-LPGLGPCGSLKAPSP 391 

• W II : : : I I I I I I : : I I I I I I I I I I : I 

Db 283 IQQVYLAIM WLAMSSTMYNPIIYCCLNDRFRLGFKHAFRCCPFISAGDYEGLEMKST 339 



Qy 392 RSSASHKSLSLQSRCSVSKISEHWLTSVTTVL 424 

I II:: II |:| : |:::||: 

Db 340 R YLQTQGSVYKVSR — LETTISTW 362 



RESULT 11 
S20304 

neurokinin 1 receptor - mouse 

N;Alternate names: NK-1 receptor; substance P receptor 
C; Species: Mus musculus (house mouse) 

C;Date: 22-Nov-1993 #sequence_revision 10-Nov-1995 #text_change 02-Jun-2000 
C;Accession: S20304; 156216; 173044 

R;Sundelin, J.B.; Provvedini, D.M.; Wahlestedt, C.R.; Laurell, H.; Pohl, J.S.; 
Peterson, P. A. 

Eur. J. Biochem. 203, 625-631, 1992 

A;Title: Molecular cloning of the murine substance K and substance P receptor 
genes . 

A; Reference number: S20303; MUID: 92137253; PMID: 1370937 
A; Accession: S2 0304 
A; Status: preliminary 
A;Molecule type: mRNA 
A; Residues: 1-407 <SUN> 

A;Cross-references: GB:X62934; NID:g54206; PIDN : CAA44707 . 1 ; PID:g54207 
R;Cook, G.A. ; Elliott, D.; Metwali, A.; Blum, A.M.; Sandor, M. ; Lynch, R. ; 
Weinstock, J.V. 

J. Immunol. 152, 1830-1835, 1994 

A; Title: Molecular evidence that granuloma T lymphocytes in murine 
schistosomiasis mansoni express an authentic substance P (NK-1) receptor. 
A;Reference number: 156216; MUID: 94165478 ; PMID: 8120392 
A;Accession: 156216 

A; Status: preliminary; translated from GB/EMBL/DDBJ 

A;Molecule type: mRNA 

A; Residues: 63-290 <C001> 

A;Cross-references : GB:L27826; NID:g450286; P I DN : AAA1 78 91.1; PID:g480776 
A; Experimental source: tissue brain 
A; Accession: 173 04 4 

A; Status: preliminary; translated from GB/EMBL/DDBJ 

A;Molecule type: mRNA 

A; Residues: 63-290 <C002> 

A;Cross-references: GB:L27828; NID:g450290; PIDN:AAA17892 . 1; PID:g480778 
A; Experimental source: tissue granuloma 
C; Superfamily : neurokinin 1 receptor 

C;Keywords: G protein-coupled receptor; transmembrane protein 

Query Match '13.8%; Score 417; DB 2; Length 407; 

Best Local Similarity 27.3%; Pred. No. 8.5e-28; 

Matches 111; Conservative 70; Mismatches 149; Indels 76; Gaps 8 

Qy 2 9 DEFLRYLWRDYLYPKQYEWLIAAWAVFWALVGNTLVCIAWRNHHMRTVTNYFIVNL 8 8 

: : I • : I : I s III : I : : I I I : I : : I I I I I I I I : I I I 

Db 23 NQFVQPTWQIVLW AAAYTVI WTSWGNVWIWIILAHKRMRTWNYFLVNL 74 

QY 89 SIAI)VLWAICLPASLLVDITESWLFGHALCKVIPYLQAVSVSVAVLTLSFIALDRWYAI 148 

: I : : I : : I : I II : :::::::: | | | : I | 

Db 75 AFAEACMAAFNTVVNFTYAVHNVWYYGLFYCKFHNFFPIAALFASIYSMTAVAFDRYMAI 134 



Qy 


149 


CHPLLFKSTARRARGSILGIWAVSLAIMVPQA AVMECSSVLPELANRTRLF 199 

Ml : • : 1 : 1 1 1 : : I : I I : : | | | | | | 
IHPLQPRLSATATKWI FVIWVLALLLAFPQGYYSTTETMPSRVVCMIEWPEHPNRT 191 


Db 


135 


Qy 


200 


SVCDERWADDLYPKI YHSCFFI VT YLAPLGLMAMAYFQI FRKLWGRQI PGTT SALVRNWK 259 
1 1 1 1 1 : : 1 II:: II : II : 1 1 1 : 1 
YEKAYHICVTVLIYFLPLLVIGYAYTWGITLWASEIPGDSSDRYH 237 


Db 


192 


Qy 


260 


RPSDQLGDLEQGLSGEPQPRARAFLAEVKQMRARRKTAKMLMWLLVFALCYLPISVLNV 319 

: 1 : 1 : 1 1 1 1 : : 1 I : II : 1 : II : : 

1 1 IP F 1 II" 1 1 " f * 1 1 * • 


Db 


238 


Qy 


320 


LKRVFGMFRQASDREAVYACFTFSHWLVYANSAANPIIYNFLSGKFREQFKAAFSCC-LP 378 
1 : : 1 1 III : : : 1 1 I I I 1 : : I I II 1 1 1 1 
LPYINPDLYLKKFIQQVYLA SMWLAMS STMYNPI I YCCLNDRFRLGFKHAFRCCPFI 326 


Db 


270 


Qy 


379 


GLGPCGSLKAPSPRSSASHKSLSLQSRCSVSKISEHWLTSVTTVL 424 

1 1 : M M : : | | | : | : | : : : | | : 
SAGDYEGLEMKSTR YLQTQSSVYKVSR— LETTISTW 362 


Db 


327 



RESULT 12 
151898 

cholecystokinin A receptor - guinea pig 
C;Species: Cavia porcellus (guinea pig) 

C;Date: 04-Sep-1997 #sequence_revision 04-Sep-1997 #text_change 20-Apr-2000 
C;Accession: 151898 

R;De Weerth, A.; Pisegna, J.R.; Wank, S.A. 
Am. J. Physiol. 265, G1116-G1121, 1993 

A;Title: Guinea pig gallbladder and pancreas possess identical CCK-A receptor 

subtypes: receptor cloning and expression. 

A;Reference number: 151898; MUID: 94106629; PMID:7916580 

A; Accession: 151898 

A; Status: preliminary; translated from GB/EMBL/DDBJ 
A;Molecule type: mRNA 
A; Residues: 1-430 <RES> 

A;Cross-references : GB:S68242; NID:g544723; PIDN : AAB2 9504 . 1 ; PID:g544724 
C; Superf amily : neurokinin 1 receptor 

Query Match 18.8%; Score 417; DB 2; Length 430; 

Best Local Similarity 28.4%; Pred. No. 9e-28; 

Matches 122; Conservative 81; Mismatches 169; Indels 58; Gaps 13; 



Qy 16 GS REPS PVPPDYEDEFLRYLWRDYLYPK-Q YEW VL I AAYVAVFVVAL VGNT LVCLAV 71 

II I : I : I I II I : II II I : | : : : : : || I I I : 

Db 11 GSNITSACELGFENETLFCLDR- PRPSKEWQPAVQILLYSLIFLLSVLGNTLVITVL 66 

Qy 72 WRNHHMRTWNYFIWLSLADVLVTAICLPASLLVDITESWLFGHALCKVIPYLQAVSVS 131 

II I I I I I I I : : : I : : : I : : : I : I : I : : : : : | | | : | | | | | | 

Db 67 IRNKRMRTVTNIFLLSLAVSDLMLCLFCMPFNLIPSLLKDFIFGSAVCKTTTYFMGTSVS 126 

QY 132 VAVLTLSFIALDRWYAICHPLLFK--STARRARGSILGIWAVSLAIMVPQAAVMECSSVL 189 

I : I I : I : I : I I I I I : I I | | : | | | | | : : : 

Db 127 VSTFNLVAISLERYGAICKPLQSRVWQTKSHALKVIAATWCLSFTIMTPYPIY SNLV 183 

Qy 190 PEIANRTRLFSVCDERWADDLYPKIYHSCFFIVT YLAPLGLMAMAYFQI FRKLW-GRQIP 248 

I I : : : I : I : : : I : : : : I I : I : I I I : I : I : 



Db 



184 PFTKNNNQTGNMCRFLLPNDVMQQTWHTFLLLILFLI PGIVMMVAYGLI SLELYQGI KFD 24 3 



Qy 



249 GTT S ALVRNWKRP S DQLGDLEQG LSGEPQPRARAFLAEVKQ 289 

: I : I : I I II I :: I 

244 AIQKKSAKERKTSTGSSGPMEDSDGCYLQKSRHPRK LELRQLSPSSSGSNRINRIR 299 



Db 



Qy 



Db 



290 MRARRKTAKMLMWLLVFALC YL P I S VLNVLKRVFGMFRQAS DREAVYAC FT F 342 

: I : : : : I I : I : : : : I I I : : I I | : : : : : | - | 

300 SSSSTANLMAKKRVIRMLIVIWLFFLCWMPIFSANAWRAYDTV SAERHLSGTPISF 356 



Db 



Qy 



343 SHWLVYANSAANPIIYNFLSGKFREQFKAAFSCC LPGL-GPCG SLKAPSP 391 

I I : I I I I I I I : : : | | I I I I I I I : , | : | 

357 ILLLSYTSSCVNPIIYCFMNKRFRLGFMATFPCCPNPGTPGVRGEMGEEEEGRTTGASLS 416 



Qy 



392 RSSASHKSLS 401 



Db 



417 RYSYSHMSTS 426 



RESULT 13 
JQ1517 

neurokinin 3 receptor - human 

N;Alternate names: neuromedin K receptor; NK-3 receptor 
C; Species: Homo sapiens (man) 

C;Date: 20-Apr-2000 #sequence_revision 20-Apr-2000 #text_change 19-May-2000 
C;Accession: JQ1517; S20435; S21237 

R;Huang, R.R.C.; Cheung, A.H.; Mazina, K.E.; Strader, CD.; Fong, T.M. 
Biochem. Biophys. Res. Commun . 184, 966-972, 1992 

A; Title: cDNA sequence and heterologous expression of the human neurokinin-3 
receptor. 

A; Reference number: JQ1517; MUID : 92246993 ; PMID: 1374246 
A; Accession : JQ1517 

A; Status: nucleic acid sequence not shown 
A;Molecule type: mRNA 
A; Residues: 1-465 <HUA> 

A;Cross-references: GB:M89473; NID:gl89223; PIDN : AAA3 6366 . 1 ; PID:gl89224 
A; Experimental source: brain 

R; Buell , G. ; Schulz, M.F.; Arkinstall, S.J.; Maury, K. ; Missotten, M. ; Adami, 
N.; Talabot, F. ; Kawashima, E. 
FEBS Lett. 299, 90-95, 1992 

A;Title: Molecular characterisation, expression and localisation of human 
neurokinin-3 receptor. 

A; Reference number: S20435; MUID : 92183914 ; PMID: 1312036 
A; Accession: S20435 
A; Status: preliminary 
A; Molecule type: DNA 

A;Residues: 1-2, 'I 1 , 4- 62, 1 R f , 64-4 65 <BUE> 

A;Cross-references: GB:S86392; NID:g246908; PIDN :AAB2 1706 . 1 ; PID:g246909 
R;Takahashi, K. ; Tanaka, A.; Hara, M. ; Nakanishi, S. 
Eur. J. Biochem. 204, 1025-1033, 1992 

A;Title: The primary structure and gene organization of human substance P and 
neuromedin K receptors. 

A; Reference number: S21188; MUID: 92201186; PMID: 1312928 
A; Accession: S21237 
A; Status: preliminary 
A;Molecule type: DNA 

A; Residues : 1-438, 1 F 1 , 440-465 <TAK> 



A/Cross-references: GB:X65172; NID:g35022; PIDN : CAA4 62 91 . 1 ; PID:g825695 

C; Comment: The endogenous ligand of this receptor is neurokinin 3 (neuromedin 

K) , one of the peptides in the mammalian tachykinin system. 

C; Genetics : 

A; Gene : GDB : TACR3 

A; Cross-references : GDB: 959912 6 

C;Superfamily: neurokinin 1 receptor 

C;Keywords: G protein-coupled receptor; glycoprotein; hormone receptor; 
transmembrane protein 

F; 8 3- Ill/Domain : transmembrane #status predicted <TM1> 
F; 122- 147 /Domain : transmembrane #status predicted <TM2> 
F; 160-181/Domain: transmembrane ((status predicted <TM3> 
F;202-221/Domain: transmembrane #status predicted <TM4> 
F;247-272/Domain: transmembrane #status predicted <TM5> 
F;300-321/Domain: transmembrane #status predicted <TM6> 
F; 333- 355 /Domain : transmembrane ((status predicted <TM7> 

F;23, 50, 73/Binding site: carbohydrate (Asn) (covalent) #status predicted 
F;158-233/Disulfide bonds: #status predicted 

Query Match 18.7%; Score 416; DB 1; Length 465; 

Best Local Similarity 28.7%; Pred. No. 1.2e-27; 

Matches 108; Conservative 70; Mismatches 142; Indels 56; Gaps 10; 

Qy 4 SATPGAQMGVPPGSREPSPVPPDYEDEFLRYLWRDYLYPKQYEWVLIAAYVAVFWALVG 63 

I : : I I : I : I I II : : : I : : | | | | | | | | | | : : | 

Db 52 SSSPSA-LGLPVASPAPSQPWANLTNQFVQPSWRIAL WS L — AYGVWAVAVLG 102 

Qy 64 NTLVCIAWRNHHMRTVTNYFIWLSLADVLVTAICLPASLLVDI 123 

I : I : : I I I I I I I I : I I I : : I : I : : : | | | | : 

Db 103 NLIVIWIIIAHKRMRTVTNYFLVNLAFSDASMAAFNTLVNFIYALHSEWYFGANYCRFQN 162 

Qy 124 YLQAVS VS VAVLTL S FI ALDRWYAI CH P LL FKS TARRARGS I L GI WAVS LAIMVPQAAVM 183 

: : I : : : : : | | : | | : | | | | : : | : | | | : : s | | 
Db 163 FFPITAVFASIYSMTAIAVDRYMAIIDPLKPRLSATATKIVIGSIWILAFLLAFPQCLYS 222 

Qy 184 ECSSVLPELANRTRLFSVCDERWADDLYPK IYHSCFFI VTYLAPLGLMAMAYFQI FR 240 

' ' I s I II : I : I : II II I : I I I : I : I : 

Db 223 K-TKVMP GRT LCFVQWPEG--PKQHFTYHIIVIILVYCFPLLIMGITYTIVGI 272 

Qy 241 KLWGRQIPGTTSALVRNWKRPSDQLGDLEQGLSGEPQPRARAFLAEVKQMRARRKTAKML 3 00 

Ml :IM I :|::|:|| II: 

Db 273 TLWGGEI PGDTCDKYH EQLKAKRKWKMM 301 

Qy 301 MWLLVFALCYLP I SVLNVLKRVFGMFRQAS DREAVYACFT FSHWLVYANSAANP 1 1 YNF 360 

: : I : : I I : I : I I : : I : : : : | | | | | : : : | | | | | 

Db 302 1 1 WMT FAI CWLPYHI YFI LTAI YQQLNRWKYI QQVYLA SFWLAMSSTMYNPIIYCC 358 

Qy 361 LSGKFREQFKAAFSCC 376 

I : : I I I I I I I 
Db 359 LNKRFRAGFKRAFRWC 374 



RESULT 14 
A34916 

neurokinin 3 receptor - rat 

N; Alternate names: neuromedin K receptor; NK-3 receptor 
C; Species: Rattus norvegicus (Norway rat) 



C;Date: 22-Jan-1993 #sequence_revision 22-Jan-1993 #text_change 19-May-2000 
C; Accession: A34 916 

R;Shigemoto, R. ; Yokota, Y. ; Tsuchida, K. ; Nakanishi, S. 
J. Biol. Chem. 265, 623-628, 1990 

A;Title: Cloning and expression of a rat neuromedin K receptor cDNA. 
A;Reference number: A34916; MUID: 90110113 ; PMID:2153106 
A;Accession: A34916 
A /Molecule type: mRNA 
A; Residues: 1-452 <SHI> 

A;Cross-references: GB:J05189; NID:g205670; PIDN : AAA41688 . 1 ; PID:g205671 
C; Superf amily: neurokinin 1 receptor 

C;Keywords: G protein-coupled receptor; glycoprotein; membrane protein 

Query Match 18.7%; Score 414.5; DB 2; Length 452; 

Best Local Similarity 28.7%; Pred. No. 1.6e-27; 

Matches 106; Conservative 66; Mismatches 142; Indels 55; Gaps 8 

Qy 11 MG VP P G S RE P S P VP PDYEDEFL R YL W RD YL Y P KQ Y EWVL I AAYVAVFWAL VGN T L VC LA 7 0 

:|:| :: II I : ::|:: II I II || | ||: || :| 

Db 45 LGLPATTQAPSQVRANLTNQFVQPSWRIAL WSL— AYGLWAVAVFGNLIVIWI 9 6 

Qy 71 WRNHHMRTVTNYFIWLSIJUDVLWAICLPASLLVDITESWLFGHALCKVIPYLQAVSV 130 

: : I I I I I I I I : I I I :: I II : : : | | | | : : : | 

Db 97 I LAHKRMRT VTNYFLVNLAFS DAS VAAFNTLIN FI YGLH S EW YFGAN YCRFQNFFP I TAV 156 

QY 131 S VAVLT L S F I AL D RW YAI CHPLLFKS T ARRARG S I LG I WAVS LAI MVP Q AAVME CSS VL P 190 

:: ::: 11:11: II II : :| : | || :: : || | : 

Db 157 FAS I YSMTAI AVDRYMAI I DPLKPRLSATATKI VI GS I WI LAFLLAFPQCLY SKIK 212 

Qy 191 ELANRTRLFSVCDERWADDLYPK 1 YHSCFFIVTYLAPLGLMAMAYFQI FRKLWGRQI 2 47 

: M s I : I : II II I : I I I : I : I : Ml : I 
Db 213 VMPGRT -LCYVQWPEG — PKQHFTYHIIVIILVYCFPLLIMGVTYTIVGITLWGGEI 266 

Qy 24 8 P GTT S ALVRNWKRP S DQLGDLEQGLS GE PQPRARAFLAEVKQMRARRKTAKMLMWLLVF 307 

M I : I : : I : I I I I : : : I : : I 

Db 267 PGDTCDKYH EQLKAKRKWKMMI I VWT F 295 

Qy 308 ALC YLP I S VLNVLKRVFGMFRQAS DREAVYAC FT FSHWLVYANS AAN P 1 1 YNFLS GKFRE 367 

I : I : I I I : I : : = =11 III : : : I I I I I I : : I I 

Db 296 AI CWLP YHVYFI LTAI YQQLNRWKYIQQVYLA S FWLAMS STMYNP 1 1 YCCLNKRFRA 352 

Qy 368 QFKAAFSCC 376 

I I I I I 
Db 353 GFKRAFRWC 361 



RESULT 15 
156595 

neurokinin 2 receptor - guinea pig 

C; Species: Cavia porcellus (guinea pig) 

C;Date: 04-Sep-1997 #sequence_revision 04-Sep-1997 #text_change 20-Apr-2000 
C;Accession: 156595 

R;Aharony, D.; Little, J.; Thomas, C; Powell, S.; Downey- Jones , M. ; Graham, A. 
J. Recept. Res. 14, 399-421, 1994 

A; Title: Isolation and characterization of neurokinin A receptor cDNAs from 
guinea-pig lung and rabbit pulmonary artery. 
A;Reference number: 156595; MUID : 95182423 ; PMID:7877137 



A;Accession: 156595 

A; Status: preliminary; translated from GB/EMBL/DDB J 
A; Molecule type: rriRNA 
A; Residues: 1-402 <RES> 

A;Cross-references: GB:S76253; NID:g913274; PIDN : AAB33553 . 1 ; PID:g913275 
C; Superf amily : neurokinin 1 receptor 

Query Match 18.6%; Score 413.5; DB 2; Length 402; 

Best Local Similarity 29.7%; Pred. No. 1.7e-27; 

Matches 113; Conservative 64; Mismatches 134; Indels 69; Gaps 12; 

Qy 47 WVL 1 AAYVAVFVVALVGNT L VC LAVWRNHHMRT VTN YFI VNL S LADVL VT AI C L PAS 103 

II I I : I : : I I : I I I : : M I I I I I I I I I I : | | | : : | s 

Db 31 WQLAL WAT AYLALVLVAW GNAT VT^ 9 0 

QV 104 LLVDITESWLFGHALCKVIPYLQAVSVSVAVLTLSFIALDRWYAICHPLLFKSTARRARG 163 

: I I I I I : : | : : : : : | | : | | : I I I I : : I : 

Db 91 FVYASHNIWYFGRAFCYFQNLFPITAMFVSIYSMTAIAIDRYMAIVHPFQPRLSAPSTKA 150 

164 SILGIWAVSLAIMVPQAAVMECSSVLPELANRTRLFSVCDERWADDLYPK IYHSCFF 220 

I I I I I : I I : I I |:| | : | | : | | . | | 

Db 15 1 VI GGIWLVALALAFPQCFY STITEDEGATK CWAWPEDSRDKSLLLYHLWI 202 

^ 221 IVTYLAPLGLMAMAYFQIFRKLWGRQIPGTTSALVRNWKRPSDQLGDLEQGLSGEPQPRA 2 80 

: : I I I I : I : I I I III: | | 

Db 203 VLIYLLPLTVMFVAYSIIGLTLWRRAV PRH 2 32 

Qy 281 RAFLAEVKQMRARRKTAKMLMWLLVF7VLCYLPISVLNVLKRVFGMFRQASDREAVYACF 340 
: I I : : : : I : : I I : : : | : : | | : | : | | : : | | : | : | | 

233 QAHGANLRHLQAKKKFVKTMVLVWTFAI CWLPYHLYFIL GSFQEDIY-CH 282 

^ 341 TFSH WLVYANSAANPIIYNFLSGKFREQFKAAFSCCLPGLGPCGSLKAPSPRS 393 

I M : : : I I I I I I : : I I I : I I I I I : | : 
Db 283 KFI QQVYLALFWLAMS S TMYNP 1 1 YCCLNRRFRS GFRLAFRCC- PWVT P TEEDKL 336 

Qy 394 SASH-KSLSLQ-SRCSVSKI 411 

: I I I I : : I I : I 
Db 337 ELTHTPSFSLRVNRCHTKEI 356 



Search completed: September 28, 2004, 10:00:55 
Job time : 41 sees 



GenCore version 5.1.6 
Copyright (c) 1993 - 2004 Compugen Ltd. 



OM protein - protein search, using sw model 

Run on: September 28, 2004, 10:00:18 ; Search time 52 Seconds 

(without alignments) 
2628.124 Million cell updates/se 



Title: 

Perfect score: 
Sequence: 

Scoring table: 



US-10-070-532-2 
2219 

1 ME P SAT P GAQMGVP P G S RE P . 



. CSVSKISEHWLTSVTTVLP 425 



BLOSUM62 
Gapop 10.0 



Gapext 0 . 5 



Searched: 



1349238 seqs, 321558718 residues 



Total number of hits satisfying chosen parameters: 



1349238 



Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 



Database 



Published_Applications__AA: * 

1 : / cgn2_6/ptodata/ l/pubpaa/US07_PUBCOMB . pep : * 

2 : /cgn2_6/ptodata/l/pubpaa/PCT_NEW_PUB.pep: * 

3: /cgn2_6/ptodata/l/pubpaa/US06_NEW_PUB.pep: * 

4 : /cgn2_6/ptodata/l/pubpaa/US06_PUBCOMB.pep: * 

5 : /cgn2_6/ptodata/ l/pubpaa/US07_NEW_PUB . pep : * 

6 : /cgn2_6/ptodata/l/pubpaa/PCTUS_PUBCOMB . pep : * 

7 : /cgn2_6/ptodata/l/pubpaa/US08_NEW_PUB.pep: * 

8 : /cgn2_6/ptodata/l/pubpaa/US08_PUBCOMB.pep: * 

9: /cgn2_6/ptodata/l/pubpaa/US09A_PUBCOMB.pep:* 

10 : /cgn2_6/ptodata/l/pubpaa/US09B_PUBCOMB . pep : * 

11: /cgn2_6/ptodata/l/pubpaa/US09C_PUBCOMB.pep:* 

12 : /cgn2_6/ptodata/l/pubpaa/US09_NEW_PUB.pep: * 

13: /cgn2_6/ptodata/l/pubpaa/US10A_PUBCOMB.pep:* 

14 : /cgn2_6/ptodata/l/pubpaa/US10B_PUBCOMB.pep:* 

15: /cgn2_6/ptodata/l/pubpaa/US10C_PUBCOMB.pep:* 

16: /cgn2_6/ptodata/l/pubpaa/US10_NEW_PUB.pep:* 

17: /cgn2_6/ptodata/l/pubpaa/US60_NEW_PUB.pep: * 

18 : /cgn2_6/ptodata/l/pubpaa/US60_PUBCOMB.pep: * 



Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 
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ALIGNMENTS 



RESULT 1 
US-09-961-848-2 

; Sequence 2, Application US/09961848 
; Patent No. US20020146719A1 
; GENERAL INFORMATION: 

; APPLICANT: Berglind Ran Olafsdottir 



; APPLICANT: Jeffrey Gulcher 

TITLE OF INVENTION: HUMAN NARCOLEPSY GENE 
; FILE REFERENCE: 2345.1005-004 
; CURRENT APPLICATION NUMBER: US/09/961, 848 
; CURRENT FILING DATE: 2001-09-24 
; PRIOR APPLICATION NUMBER: US 09/479,128 
; PRIOR FILING DATE: 2000-01-07 
; PRIOR APPLICATION NUMBER: US 09/379,083 
; PRIOR FILING DATE: 1999-08-23 
; NUMBER OF SEQ ID NOS : 22 

; SOFTWARE: FastSEQ for Windows Version 4.0 
; SEQ ID NO 2 

LENGTH: 425 
; TYPE: PRT 

; ORGANISM: Homo Sapiens 
US-09-961-848-2 

Query Match 100.0%; Score 2218; DB 9; Length 425; 

Best Local Similarity 99.8%; Pred. No. l.le-197; 

Matches 424; Conservative 1; Mismatches 0; Indels 0; Gaps 0; 

Qy 1 MEPSATPGAQMGVPPGSREPSPVPPDYEDEFLRYLWRDYLYPKQYEWVLIAAYVAVFWA 60 

I I I I I I I I I I I I II I I I I I I I I I I t I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I 
Db 1 ME P SAT P GAQMGVP P GS RE P S P VP P D YE DE FL RYLWRD YL Y P KQ YEWVL I AAYVAVFWA 60 

Qy 61 LVGNTLVCLAWRNHHMRTWNYFIWLSLADVLVTAICLPASLLVDITESWLFGHALCK 120 

I I I I I I I I I II I I I I I I I I I I I I I II I II I I I I I I I I I I I I I I I I I I I I I II I I II I I I I 
Db 61 LVGNTLVCLAWRNHHMRTWNYFIWLSIJMDVLVTAICLPASLLVDITESWLFGHALCK 120 

Qy 121 VI P Y LQ AVS VS VAVL T L S F I AL D RW YAI CHPLLFKS T ARRARG S I L G I WAVS LAI MVP Q A 180 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I 
Db 121 VIPYLQAVSVSVAVLTLSFIALDRWYAICHPLLFKSTARRARGSILGIWAVSLAIMVPQA 180 

Qy 181 AVMECSSVLPELANRTRLFSVCDERWADDLYPKIYHSCFFIVTYLAPLGLMAMAYFQIFR 240 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I 
Db 181 AVMECSSVLPELANRTRLFSVCDERWADDLYPKIYHSCFFIVTYLAPLGLMAMAYFQIFR 240 

Qy 241 KLWGRQIPGTTSALVRNWKRPSDQLGDLEQGLSGEPQPRARAFLAEVKQMRARRKTAKML 300 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I It I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 241 KLWGRQIPGTTSALVRNWKRPSDQLGDLEQGLSGEPQPRARAFLAEVKQMRARRKTAKML 300 

Qy 301 MWLLVFALCYLPISVLNVLKRVFGMFRQASDREAVYACFTFSHWLVYANSAANPIIYNF 360 

I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I II I I I I I I I I I I I I I I I I I 
Db 301 MWLLVFALC YLP I S VLNVLKRVFGMFRQAS DREAVYAC FT FS HWLVYANS AAN P 1 1 YN F 360 

Qy 361 LSGKFREQFKAAFSCCLPGLGPCGSLKAFSPRSSASHKSLSLQSRCSVSKISEHWLTSV 420 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I : I I I I I | I I I I I I 
Db 361 LSGKFREQFKAAFSCCLPGLGPCGSLKAPSPRSSASHKSLSLQSRCSISKISEHWLTSV 420 

Qy 421 TTVLP 425 

I I I I I 

Db 421 TTVLP 425 



RESULT 2 

US-09-828-538-24 

; Sequence 24, Application US/09828538 



Patent No. US20010025031A1 
GENERAL INFORMATION: 
APPLICANT: Ellis, Catherine E . 
APPLICANT: Kwok, Cheni 
APPLICANT: Bodsworth, Nicola J. 
APPLICANT: Halsey, Wendy 
APPLICANT: Van Horn, Stephanie 

TITLE OF INVENTION: HFGAN72 Receptor Genomic DNA and Methods 
TITLE OF INVENTION: of Use Thereof in Diagnostic Applications 
FILE REFERENCE: GH-50038-C1 
CURRENT APPLICATION NUMBER: US/ 09/828 , 538 
CURRENT FILING DATE: 2001-04-06 
PRIOR APPLICATION NUMBER: 60/088,624 
PRIOR FILING DATE: 1998-06-08 
PRIOR APPLICATION NUMBER: 60/093,726 
PRIOR FILING DATE: 1998-07-22 
PRIOR APPLICATION NUMBER: 09/328,014 
PRIOR FILING DATE: 1999-06-08 
NUMBER OF SEQ ID NOS : 24 

SOFTWARE: FastSEQ for Windows Version 3.0 
SEQ ID NO 24 
LENGTH: 42 5 
TYPE: PRT 

ORGANISM: HOMO SAPIENS 
US-09-828-538-24 



Query Match 99.8%; Score 2215; DB 9; Length 425; 

Best Local Similarity 99.8%; Pred. No. 2e-197; 

Matches 424; Conservative 0; Mismatches 1; Indels 0; Gaps 0 

Qy 1 ME P SAT P GAQMGVP P GS RE P S PVP PD YEDEFLRYLWRD YL YP KQ YEWVL I AAYVAVFWA 60 

I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1 ME P SAT P GAQMGVP P G S RE P S PVP P DYED E FLRYLWRD YL Y P KQ YEWVL I AAYVAVFWA 60 

Qy 61 LVGNTLVCLAWRNHHMRTWNYFIWLSIADVLVTAICLPASLLVDITESWLFGHALCK 120 

I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 61 L VGN T L VC LAVW RN H HMRT VTN Y F I VN L S LAD VL VT AI C L PAS L LVD I T E S W L FGHAL C K 120 



Qy 121 VIPYLQAVSVSVAVLTLSFIALDRWYAICHPLLFKSTARRARGSILGIWAVSLAIMVPQA 180 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 121 VI P YLQ AVS VS VAVLT L S F I AL D RW YAI CHPLLFKS TARRARG S I LG I WAVS LAI MVP QA 180 

Qy 181 AVMECSSVLPELANRTRLFSVCDERWADDLYPKIYHSCFFIVTYLAPLGLMAMAYFQIFR 240 

I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 181 AVMECSSVLPELANRTRLFSVCDERWADDLYPKIYHSCFFIVTYLAPLGLMAMAYFQIFR 240 

Qy 241 KLWGRQIPGTTSALVRNWKRPSDQLGDLEQGLSGEPQPRARAFLAEVKQMRARRKTAKML 300 

I I I I I II I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I II I I I I I I I I I I I I 

Db 241 KLWGRQIPGTTSALVRNWKRPSDQLGDLEQGLSGEPQPRGRAFLAEVKQMRARRKTAKML 300 

Qy 301 MWLLVFALCYLPISVLNVLKRVFGMFRQASDREAVYACFTFSHWLVYANSAANPIIYNF 360 

I I I II I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 301 MWLLVFALC YL P I S VLNVLKRVFGMFRQAS DREAVYAC FT FSHWLVYAN S AAN P 1 1 YN F 360 



Qy 361 LSGKFREQFKAAFSCCLPGLGPCGSLKAPSPRSSASHKSLSLQSRCSVSKISEHWLTSV 42 0 

I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I 
Db 361 LSGKFREQFKAAFSCCLPGLGPCGSLKAPSPRSSASHKSLSLQSRCSVSKISEHWLTSV 420 



Qy 421 TTVLP 425 

I I I I I 

Db 421 TTVLP 425 



RESULT 3 

US-09-828-538-19 

Sequence 19, Application US/09828538 
Patent No. US20010025031A1 
GENERAL INFORMATION: 
APPLICANT: Ellis, Catherine E. 
APPLICANT: Kwok, Cheni 
APPLICANT: Bodsworth, Nicola J. 
APPLICANT: Halsey, Wendy 
APPLICANT: Van Horn, Stephanie 

TITLE OF INVENTION: HFGAN72 Receptor Genomic DNA and Methods 
TITLE OF INVENTION: of Use Thereof in Diagnostic Applications 
FILE REFERENCE: GH-50038-C1 
CURRENT APPLICATION NUMBER: US/09/828,538 
CURRENT FILING DATE: 2001-04-06 
PRIOR APPLICATION NUMBER: 60/088,624 
PRIOR FILING DATE: 1998-06-08 
PRIOR APPLICATION NUMBER: 60/093,726 
PRIOR FILING DATE: 1998-07-22 
PRIOR APPLICATION NUMBER: 09/328,014 
PRIOR FILING DATE: 1999-06-08 
NUMBER OF SEQ ID NOS : 24 

SOFTWARE: FastSEQ for Windows Version 3.0 
SEQ ID NO 19 
LENGTH: 42 5 
TYPE: PRT 

ORGANISM: HOMO SAPIENS 
US-09-828-538-19 

Query Match 99.8%; Score 2214; DB 9; Length 425; 

Best Local Similarity 99.5%; Pred. No. 2.5e-197; 

Matches 423; Conservative 1; Mismatches 1; Indels 0; Gaps 0; 

Qy 1 MEPSATPGAQMGVPPGSREPSPVPPDYEDEFLRYLWRDYLYPKQYEWVLIAAYVAVFWA 60 

I M I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1 ME P SAT P GAQMGVP P G S RE P S P VP P D YE D E FL RYLWRD YL Y P KQ YEWVL I AAYVAVFWA 60 

Qy 61 LVGNTLVCLAWRNHHMRTVTNYFIWLSLADVXVTAICLPASLLVDITESWLFGHALCK 120 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I II I 
Db 6.1 LVGNTLVCLAWRNHHMRTWNYFIWLSl^DVLWAICLPASLLVDITESWLFGHALCK 12 0 

Qy 121 VI P YLQAVS VS VAVLT L S FI AL DRW YAI CH P LL FK S T ARRARG S I LG I WAVS LAI MVPQA 180 

I I M I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I | | I I I I I I I I I I I I I I 

Db 121 VI P YLQAVS VS VAVLT L S FI ALD RW YAI CHPLLFKSTARRARGS I LGI WAVS LAI MVPQA 180 

Qy 181 AVMECSSVLPELANRTRLFSVCDERWADDLYPKIYHSCFFIVTYLAPLGLMAMAYFQIFR 240 

I M I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I 
Db 181 AWECSSVLPEIxANRTRLFSVCDERWADDLYPKIYHSCFFIVTYLAPLGLMAMAYFQIFR 240 

Qy 241 KLWGRQIPGTTSALVRNWKRPSDQLGDLEQGLSGEPQPRARAFLAEVKQMRARRKTAKML 300 

I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I II I I I I I I I I I I I I I I I 



Db 241 KLWGRQIPGTTSALVRNWKRPSDQLGDLEQGLSGEPQPRGRAFLAEVKQMRARRKTAKML 300 

Qy 301 MWLLVFALCYLPISVLNVLKRVFGMFRQASDREAVYACFTFSHWLVYANSAANPIIYNF 360 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I II I I I I I I I I I I I I I I I I I I I 
Db 301 MWLLVFALC YLP I SVLNVLKRVFGMFRQAS DREAVYAC FT FS HWLVYANS AAN P 1 1 YN F 360 

QY 361 LSGKFREQFKAAFSCCLPGLGPCGSLKAPSPRSSASHKSLSLQSRCSVSKISEHWLTSV 42 0 

I I M I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I : I I I I I I I I || | | 
Db 361 LSGKFREQFKAAFSCCLPGLGPCGSLKAPSPRSSASHKSLSLQSRCSISKISEHWLTSV 420 

Qy 421 TTVLP 425 

I I I I I 

Db 421 TTVLP 425 



RESULT 4 

US-09-211-823C-22 

Sequence 22, Application US/09211823C 
Publication No. US20030087801A1 
GENERAL INFORMATION: 
APPLICANT: HAGEN, JAMES JOSEPH 
APPLICANT: TERRETT, JONATHAN ALEXANDER 
APPLICANT: UPTON, NEIL 
APPLICANT: PIPER, DAVID 
APPLICANT: SMITH, MARTIN IAN 
APPLICANT: KENNETT, GUY ANTHONY 
APPLICANT: PATEL, SARAS WAT I R. 



TITLE OF INVENTION 
TITLE OF INVENTION 
TITLE OF INVENTION 



METHODS OF TREATMENT USING NOVEL LIGANDS 
OF THE NEUROPEPTIDE RECEPTOR HFGAN72 AND AGONISTS OR 
ANTAGONISTS THEREOF 
FILE REFERENCE: P50745 

CURRENT APPLICATION NUMBER: US/09/211, 823C 
CURRENT FILING DATE: 1998-12-15 
PRIOR APPLICATION NUMBER: US 60/069,459 
PRIOR FILING DATE: 1997-12-15 
PRIOR APPLICATION NUMBER: US 60/069,785 
PRIOR FILING DATE: 1997-12-16 
NUMBER OF SEQ ID NOS : 23 

SOFTWARE: FastSEQ for Windows Version 3.0 
SEQ ID NO 22 
LENGTH: 425 
TYPE: PRT 

ORGANISM: HOMO SAPIENS 
US-09-211-823C-22 



Query Match ■ 99.8%; Score 2214; DB 10; Length 425; 

Best Local Similarity 99.5%; Pred. No. 2.5e-197; 

Matches 423; Conservative 1; Mismatches 1; Indels 0; Gaps 0; 

Qy 1 MEPSATPGAQMGVPPGSREPSPVPPDYEDEFLRYLWRDYLYPKQYEWVLIAAYVAVFWA 60 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I II I I I I I I I I I I I I I 
Db 1 MEPSATPGAQMGVPPGSREPSPVPPDYEDEFLRYLWRDYLYPKQYEWVLIAAYVAVFWA 60 

Qy 61 LVGNTLVCLAWRNHHMRTWNYFIWLSLADVLVTAICLPASLLVDITESWLFGHALCK 120 

I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I II I I I I I I I I I I I I I I I I I I 
Db 61 LVGNTLVCIAWRNHHMRTVTNYFIWLSLADVLVTAICLPASLLVDITESWLFGHALCK 12 0 



Qy 121 VI P YLQAVS VS VAVLTLS FI ALDRW YAI CH P LL FKS TARRARGS I LGI WAVS LAI MVP QA 180 

I I I I I I I M I I I I I I I II I I I I I I I I I I I I t I I I I I I I I I I I II I I I I I II I I I I I I I I I 
Db 121 VIPYLQAVSVSVAVLTLSFIALDRWYAICHPLLFKSTARRARGSILGIWAVSLAIMVPQA 180 

QY 181 AVMECSSVLPEIANRTRLFSVCDERWADDLYPKIYHSCFFIVTYLAPLGLMAMAYFQIFR 24 0 

I I I I I I II I I I I I II I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I II I I I II I I I I 
Db 181 AVMECSSVLPEIANRTRLFSVCDERWAI)DLYPKIYHSCFFIVTYLAPLGLMAMAYFQI FR 240 

Qy 241 KLWGRQIPGTTSALVRNWKRPSDQLGDLEQGLSGEPQPRARAFLAEVKQMRARRKTAKML 300 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I II I I I 
Db 241 KLWGRQIPGTTSALVRNWKRPSDQLGDLEQGLSGEPQPRGRAFLAEVKQMRARRKTAKML 300 

Qy 301 MWLLVFALC YL P I S VLNVLKRVFGMFRQAS DREAVYACFT FS HWLVYAN S AANP 1 1 YN F 360 

I I I I I I I M I I I I I I I I I I I I I I I I II I I I I I I I I I II I I I I M I I I I I I I I I I I I I I I I 
Db 301 MVVLLVFALCYLPISVLNVLKRVFGMFRQASDREAVYACFTFSHWLVYANSAANPIIYNF 360 

QY 361 LSGKFREQFKAAFSCCLPGLGPCGSLKAPSPRSSASHKSLSLQSRCSVSKISEHWLTSV 420 

I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I : I I I I I II 

Db 361 LSGKFREQFKAAFSCCLPGLGPCGSLKAPSPRSSASHKSLSLQSRCSISKISEHWLTSV 420 

Qy 421 TTVLP 425 

I I I I I 

Db 421 TTVLP 425 



RESULT 5 

US-10-225-567A-368 

; Sequence 368, Application US/10225567A 

; Publication No. US20030113798A1 

; GENERAL INFORMATION: 

; APPLICANT: Lifespan Biosciences 

; APPLICANT: Brown, Joseph P. 

; APPLICANT: Burmer, Glenna C. 

; APPLICANT: Roush, Christine L. 

; TITLE OF INVENTION: ANTIGENIC PEPTIDES AND ANTIBODIES FOR G PROTEIN-COUPLED 

RECEPTORS (GPCRS) 

; FILE REFERENCE: 1920-4-4 

; CURRENT APPLICATION NUMBER: US/ 10/225 , 5 67A 

; CURRENT FILING DATE: 2001-12-19 

; PRIOR APPLICATION NUMBER: 60/257,144 

; PRIOR FILING DATE: 2000-12-19 

; NUMBER OF SEQ ID NOS : 2292 

; SOFTWARE: Patentln version 3.1 

; SEQ ID NO 368 

LENGTH: 42 5 
; TYPE: PRT 

ORGANISM: Homo sapiens 
US-10-225-567A-368 



Query Match 99.8%; 
Best Local Similarity 99.5%; 
Matches 423; Conservative 



Score 2214; DB 14 
Pred. No. 2.5e-197 
1; Mismatches 1 



Length 425; 

Indels 0; Gaps 0; 



Qy 

Db 



1 MEPSATPGAQMGVPPGSREPSPVPPDYEDEFLRYLWRDYLYPKQYEWVLIAAYVAVFWA 60 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I 
1 MEPSATPGAQMGVPPGSREPSPVPPDYEDEFLRYLWRDYLYPKQYEWVLIAAYVAVFWA 60 



Qy 61 LVGNTLVCl^WRNHHMRTVTNYFIWLSLADVLVTAICLPASLLVDITESWLFGHALCK 120 

I I I I I I I I I M I I I I I I I I I I I I I I I I II I I I I I I II I I I I I I I I II I I I I II I I I I I I I 
Db 61 LVGNTLVCIAWRNHHMRTVTNYFIWLSLADVLVTAICLPASLLVDITESWLFGHALCK 120 



Qy 121 VIPYLQAVSVSVAVLTLSFIALDRWYAICHPLLFKSTARRARGSILGIWAVS LAIMVPQA 180 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 121 VI PYLQAVSVSVAVLTLS FIALDRWYAI CHPLLFKSTARRARGS I LGIWAVS LAIMVPQA 180 

Qy 181 AVMECSSVLPELANRTRLFSVCDERWADDLYPKIYHSCFFIVTYLAPLGLMAMAYFQIFR 240 

I I I I I I M I I I M I I I I I I I I I I I I I I I I I I II II I I I I I I I I I I I 

Db 181 AVMECSSVLPELANRTRLFSVCDERWADDLYPKIYHSCFFIVTYLAPLGLMAMAYFQIFR 240 

Qy 241 KLWGRQIPGTTSALVRNWKRPSDQLGDLEQGLSGEPQPRARAFLAEVKQMRARRKTAKML 300 

I I I I M I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I 
Db 241 KLWGRQIPGTTSALVRNWKRPSDQLGDLEQGLSGEPQPRGRAFLAEVKQMRARRKTAKML 300 

Qy 301 MWL LVFALC YL P I S VLNVL KRVFGMFRQAS DREAVYAC FT FS HWLVYAN SAAN P 1 1 YN F 360 

I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I | | | | | | | | | | | | | | | | | | M I 

Db 301 MWLLVFALC YLP I S VLNVL KRVFGMFRQAS DREAVYAC FT FS HWLVYAN SAAN P 1 1 YN F 360 

Qy 361 LSGKFREQFKAAFSCCLPGLGPCGSLKAPSPRSSASHKSLSLQSRCSVSKISEHWLTSV 420 

I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I : I I I II I I I I I I I 

Db 361 LSGKFREQFKAAFSCCLPGLGPCGSLKAPSPRSSASHKSLSLQSRCSISKISEHWLTSV 420 

Qy 421 TTVLP 42 5 

II I II 

Db 421 TTVLP 425 



RESULT 6 

US-10-352-684A-22 

Sequence 22, Application US/10352684A 
Publication No. US20030215452A1 
GENERAL INFORMATION: 
APPLICANT: Millennium Pharmaceuticals Inc. 
APPLICANT: Carroll, Joseph M. 
APPLICANT: Healy, Aileen 
APPLICANT: Weich, Nadine S. 
APPLICANT: Kelly, Louise M. 

TITLE OF INVENTION: METHODS AND COMPOSITIONS FOR TREATING 

TITLE OF INVENTION: HEMATOLOGICAL DISORDERS USING 131, 148, 199, 12303, 
13906, 

; TITLE OF INVENTION: 15513, 17822, 302, 5677, 194, 14393, 28059, 7366, 12212, 
; TITLE OF INVENTION: 1981, 261, 12416, 270, 1410, 137, 1871, 13051, 1847, 
1849, 

TITLE OF INVENTION: 15402, 340, 10217, 837, 1761, 8990 OR 13249 MOLECULES 
FILE REFERENCE: MPI02-019P1RNOMNIM 
CURRENT APPLICATION NUMBER: US/ 10/352 , 68 4A 
CURRENT FILING DATE: 2003-01-28 
PRIOR APPLICATION NUMBER: US 60/354,333 
PRIOR FILING DATE: 2002-02-04 
PRIOR APPLICATION NUMBER: US 60/360,258 
PRIOR FILING DATE: 2002-02-28 
PRIOR APPLICATION NUMBER: US 60/364,476 
PRIOR FILING DATE: 2002-03-15 
PRIOR APPLICATION NUMBER: US 60/375,626 
PRIOR FILING DATE: 2002-04-26 



PRIOR APPLICATION NUMBER: US 60/386,494 
PRIOR FILING DATE: 2002-06-06 
PRIOR APPLICATION NUMBER: US 60/390,965 
PRIOR FILING DATE: 2002-06-24 
PRIOR APPLICATION NUMBER: US 60/392,480 
PRIOR FILING DATE: 2002-06-28 
PRIOR APPLICATION NUMBER: US 60/394,128 
PRIOR FILING DATE: 2002-07-03 
PRIOR APPLICATION NUMBER: US 60/399,783 
PRIOR FILING DATE: 2002-07-31 
PRIOR APPLICATION NUMBER: US 60/4 03,221 
PRIOR FILING DATE: 2002-08-13 

Remaining Prior Application data removed - See File Wrapper or PALM. 
NUMBER OF SEQ ID NOS : 62 

SOFTWARE: FastSEQ for Windows Version 4.0 
SEQ ID NO 22 
LENGTH: 425 
TYPE: PRT 

ORGANISM: Homo Sapiens 
US-10-352-684A-22 



Query Match 99.8%; 
Best Local Similarity 99.5%; 
Matches 423; Conservative 



Score 2214; DB 15; 
Pred. No. 2.5e-197; 
1; Mismatches 1; 



Length 425; 



Indels 



0; Gaps 



0; 



Qy 


1 


Db 


1 


Qy 


61 


Db 


61 


Qy 


121 


Db 


121 


Qy 


181 


Db 


181 


Qy 


241 


Db 


241 


Qy 


301 


Db 


301 


Qy 


361 


Db 


361 


Qy 


421 


Db 


421 



MEPSATPGAQMGVPPGSREPSPVPPDYEDEFLRYLWRDYLYPKQYEWVLIAAYVAVFWA 60 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
ME P SAT P GAQMGVP P G S REP S P VP P D YED E FLRYLWRD YL Y P KQ YEWVL I AAYVAVFWA 60 

LVGNTLVCLAVWRNHHMRTWNYFIWLSLADVLVTAICLPASLLVDITESWLFGHALCK 12 0 
M I II I I I II I I I I I I I I I I I I I I I I I I I I I I I I M | | | M I I I I I I I I I I I i I I I I I I I 
LVGNTLVCLAVWRNHHMRTVTNYFIVNLSLADVLVTAICLPASLLVDITESWLFGHALCK 120 

VIPYLQAVSVSVAVLTLSFIALDRWYAICHPLLFKSTARRARGSILGIWAVSLAIMVPQA 180 

I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I II I I I I I I I I I I I I M I I I I I I I 



AVMECSSVLPELANRTRLFSVCDERWADDLYPKIYHSCFFIVTYLAPLGLMAMAYFQIFR 24 0 
I I I I I I I I I I I I I I I I I I II II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I II I I I 
AVMECSSVLPELANRTRLFSVCDERWADDLYPKIYHSCFFIVTYLAPLGLMAMAYFQIFR 240 

KLWGRQIPGTTSALVRNWKRPSDQLGDLEQGLSGEPQPRARAFLAEVKQMRARRKTAKML 300 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I II I I I I I I I 
KLWGRQIPGTTSALVRNWKRPSDQLGDLEQGLSGEPQPRGRAFLAEVKQMRARRKTAKML 300 

MWLLVFALCYLPI SVLNVLKRVFGMFRQAS DREAVi AC FT FSHWLVYANSAANP 1 1 YNF 360 
I I I I I I I I I I I I I 1 I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
MWL L VFAL C Y L P I S VLN VL K RVFGM FRQAS D REAVYAC FT F S HWL VYAN S AAN P 1 1 YN F 360 

LSGKFREQFKAAFSCCLPGLGPCGSLKAPSPRSSASHKSLSLQSRCSVSKISEHWLTSV 420 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I : I I I I I I I I I II I 
LSGKFREQFKAAFSCCLPGLGPCGSLKAPSPRSSASHKSLSLQSRCSISKISEHWLTSV 420 



TTVLP 

I I I I I 
TTVLP 



425 



425 



RESULT 7 

US-09-826-509-549 

; Sequence 549, Application US/09826509 
; Publication No. US20030204073A1 
; GENERAL INFORMATION: 

; APPLICANT: Lehmann- Bruin sma, Karin 
; APPLICANT: Liaw, Chen W. 
; APPLICANT: Lin, I-Lin 

; TITLE OF INVENTION: No. US20030204073A1-Endogenous , Cons titutively Activated 
Known G 

; TITLE OF INVENTION: Protein-Coupled Receptors 
; FILE REFERENCE: AREN-2 07 

; CURRENT APPLICATION NUMBER: US/ 09/826, 509 
; CURRENT FILING DATE: 2001-04-05 
; PRIOR APPLICATION NUMBER: 60/195,747 
; PRIOR FILING DATE: 2000-04-07 
; PRIOR APPLICATION NUMBER: 09/170,496 
; PRIOR FILING DATE: 1998-10-13 
; NUMBER OF SEQ ID NOS : 58 9 
; SOFTWARE: Patentln Version 2.1 
; SEQ ID NO 549 
; LENGTH: 425 
TYPE: PRT 

ORGANISM: Homo sapiens 
US-09-826-509-549 

Query Match 99.5%; Score 2209; DB 11; Length 425; 

Best Local Similarity 99.3%; Pred. No. 7.4e-197; 

Matches 422; Conservative 1; Mismatches 2; Indels 0; Gaps 0; 

Qy 1 ME P SAT P GAQMGVP P GS RE P S P VP P D YED E FLRYLWRD YL Y P KQ YEWVL I AAYVAVFWA 60 

I I I I I I I I I I I I 1 I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I 
Db 1 MEPSATPGAQMGVPPGSREPSPVPPDYEDEFLRYLWRDYLYPKQYEWLIAAWAVFVVA 60 

QY 61 LVGNTLVCLAWRNHHMRTVTNYFIWLSl^VLVTAICLPASLLVDITESWLFGHALCK 12 0 

I I I I I I I I I I I I I I 1 I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 61 LVGNTLVCIAWRNHHMRTVTNYFIWLSl^VLWAICLPASLLVTJITESWLFGHALCK 12 0 

Qy 121 VI P YLQ AVS VS VAVLT L S F I AL DRW YAI CH P L L FKS TARRARG S I LG I WAVS LAI MVP QA 180 

I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I i I I I I I I I I I I I I I || I M I M I 
Db 121 VIPYLQAVSVSVAVLTLSFIALDRWYAICHPLLFKSTARRARGSILGIWAVSLAIMVPQA 18 0 

Qy 181 AVMECSSVLPELANRTRLFSVCDERWADDLYPKIYHSCFFIVTYLAPLGLMAMAYFQIFR 240 

III I I I I I I I I II I I I I I I I I I I I I I I I I I I I I || | | | | I | | 

Db '181 AVMECSSVLPELANRTRLFSVCDERWADDLYPKIYHSCFFIVTYLAPLGLMAMAYFQIFR 240 

Qy 241 KLWGRQI PGTT S ALVRNWKRP S DQLGDLEQGLS GE PQPRARAFLAEVKQMRARRKTAKML 3 00 

I M I I I I I I I I I I II I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I III 
Db 241 KLWGRQI PGTT SALVRNWKRPS DQLGDLEQGLS GEPQPRGRAFLAEVKQMRARRKTKKML 300 

Qy 301 MWLLVFALCYLPISVLNVLKRVFGMFRQASDREAVYACFTFSHWLVYANSAANPIIYNF 360 

I I I I I M I I I I I I I I II I I I I I I I I II II I I I I I I I I | | | | | | | | | | | | | i | | | | | | | | | 

Db 301 MWLLVFALCYLPISVLNVLKRVFGMFRQASDREAVYACFTFSHWLVYANSAANPIIYNF 360 

Qy 361 LSGKFREQFKAAFSCCLPGLGPCGSLKAPSPRSSASHKSLSLQSRCSVSKISEHWLTSV 420 

I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I : I I II I I I I I I I I 



Db 361 LSGKFREQFKAAFSCCLPGLGPCGSLKAPSPRSSASHKSLSLQSRCSISKISEHWLTSV 420 



Qy 421 TTVLP 425 

I II I I 

Db 421 TTVLP 425 



RESULT 8 
US-10-077-874-2 

; Sequence 2, Application US/10077874 
; Publication No. US20020115155A1 
GENERAL INFORMATION: 

APPLICANT: Soppet, Daniel et al 
; TITLE OF INVENTION: Human Neuropeptide Receptor 

NUMBER OF SEQUENCES: 12 
CORRESPONDENCE ADDRESS: 
; ADDRESSEE: Human Genome Sciences, Inc. 

STREET: 9410 Key West Avenue 
CITY: Rockville 
; STATE: MD 

COUNTRY: USA 
ZIP: 20850 
COMPUTER READABLE FORM: 

MEDIUM TYPE: Floppy disk 
COMPUTER: IBM PC compatible 
OPERATING SYSTEM: PC-DOS/MS-DOS 

SOFTWARE: Patentln Release #1.0, Version #1.30 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/ 10/ 07 7 , 874 
FILING DATE: 20-Feb-2002 
CLASSIFICATION: <Unknown> 
PRIOR APPLICATION DATA: 
; APPLICATION NUMBER: 08/4 62,509 

FILING DATE: 05-JUNE-1995 
ATTORNEY/AGENT INFORMATION: 
NAME: Wales, Michele M. 
REGISTRATION NUMBER: 43,975 
REFERENCE/DOCKET NUMBER: PF168P1D1 
TELECOMMUNICATION INFORMATION: 
TELEPHONE: 301-309-8504 
; TELEFAX: 301-309-8439 

; INFORMATION FOR SEQ ID NO: 2: 
SEQUENCE CHARACTERISTICS: 

LENGTH: 4 02 amino acids 
; TYPE: amino acid 

TOPOLOGY: linear 
MOLECULE TYPE: protein 
SEQUENCE DESCRIPTION: SEQ ID NO: 2: 
US-10-077-874-2 

Query Match 95.0%; Score 2108; DB 13; Length 402; 

Best Local Similarity 100.0%; Pred. No. 1.7e-187; 

Matches 402; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 1 MEP SAT PGAQMGVP P GS RE P S PVP PD YEDEFLRYLWRD YL YPKQ YEWVLI AAYVAVFVVA 60 

I I I I M M M I II I I I I I I I I I I I I I I I I II I I I I I I I II I I I I I I I I I I I I I I I I I I I I 
Db 1 MEPSATPGAQMGVPPGSREPSPVPPDYEDEFLRYLWRDYLYPKQYEWVLIAAYVAVFWA 60 



Qy 


61 


Db 


61 


Qy 


121 


Db 


121 


Qy 


181 


Db 


181 


Qy 


241 


Db 


241 


Qy 


301 


Db 


301 


Qy 


361 


Db 


361 



LVGNTLVCLAWRNHHMRTVTNYFIWLSLADVLVTAICLPASLLVDITESWLFGHALCK 120 
I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I | I | | | | | I | I I I 
LVGNTLVCIAWRNHHMRTVTNYFIWLSIADVLVTAICLPASLLVDITESWLFGHALCK 120 

VIPYLQAVSVSVAVLTLSFIALDRWYAICHPLLFKSTARRARGSILGIWAVSLAIKVPQA 18 0 

M I I I I I I I I I I I I II I I I I I I I M I I I II I I I I I I I | | | || | | | | | M I I i I I I I i I I I 

VIPYLQAVSVSVAVLTLSFIALDRWYAICHPLLFKSTARRARGSILGIWAVSLAIMVPQA 180 

AVMECSSVLPELANRTRLFSVCDERWADDLYPKIYH 24 0 

I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I II I I I I I II I I I I I I I | | | | | | | | | | 
AVMECSSVLPELANRTRLFSVCDERWADDLYPKIYHSCFFIA/TYLAPLGLMAMAYFQIFR 240 

KLWGRQI PGTT SALVRNWKRP S DQLGDLEQGLSGEPQPRARAFLAEVKQMRARRKTAKML 300 

I I I I I I I I I I I I I I I I I I I I I I I | | | | | 1 | | | | | | 

KLWGRQI PGTTSALVRNWKRPSDQLGDLEQGLSGEPQPRARAFLAEVKQMRARRKTAKML 300 

MWLLVFALCYLPISVLNVLKRVFGMFRQASDREAVYACFTFSHWLVYANSAANPIIYNF 360 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I II I I I I I I I I I I 

MWLLVFALC YLP I S VLNVLKRVFGMFRQAS DREAVYAC FT FS HWLVYANS AAN P 1 1 YNF 360 

LSGKFREQFKAAFSCCLPGLGPCGSLKAPSPRSSASHKSLSL 4 02 
I I I I I I II I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I || I 
LSGKFREQFKAAFSCCLPGLGPCGSLKAPSPRSSASHKSLSL 4 02 



RESULT 9 
US-09-393-696-2 

; Sequence 2, Application US/09393696 
; Publication No. US2 0030022277A1 
; GENERAL INFORMATION: 

APPLICANT: Human Genome Sciences, Inc. et al. 
; TITLE OF INVENTION: Human Neuropeptide Receptor 
; FILE REFERENCE: PF168P2 

; CURRENT APPLICATION NUMBER: US/ 09/3 93 , 696 

; CURRENT FILING DATE: 1999-09-10 

; EARLIER APPLICATION NUMBER: PCT/US95/ 05616 

; EARLIER FILING DATE: 1995-05-05 

; EARLIER APPLICATION NUMBER: US08/462,509 

; EARLIER FILING DATE: 1995-06-05 

; NUMBER OF SEQ ID NOS : 23 

; SOFTWARE: Patentln Ver. 2.0 

; SEQ ID NO 2 

LENGTH: 402 

TYPE: PRT 

ORGANISM: Homo sapiens 
US-09-393-696-2 



Query Match 94 . 1%; 

Best Local Similarity 99.0%; 
Matches 398; Conservative 



Score 2087; DB 10 
Pred. No. 1.6e-185 
0; Mismatches 4 



Length 4 02; 

Indels 0; Gaps 0; 



Qy 

Db 



1 MEPSATPGAQMGVPPGSREPSPVPPDYEDEFLRYLWRDYLYPKQYEWVLIAAYVAVFWA 60 

I I I I I I I I I I I I I II I I I I I II I I II I I I I I I I I I I I I 

1 ME P SAT P GAQMGVP P GS RE P S P VP P D YE DE FLR YLWRD YL Y P KQ YEWVL I P AYVAVFWA 60 



Qy 61 LVGNTLVCLAWRNHHMRTVTNYFIWLSLADVLVTAICLPASLLVDITESWLFGHALCK 120 



Db 


61 


1 1 1 1 1 M 1 1 1 1 II 1 1 1 1 1 1 1 1 1 II 1 1 I I I | | 1 1 1 1 1 1 1 1 1 1 1 1 I I I I I | | | | | | | | | | M 
LVGNTLVCI^WRNHHMRTVTNYFIWLS]^VLVTAICLPASLLVDITESWLFGHALCK 


120 


Qy 


121 


VIPYLQAVSVSVAVLTLSFIALDRWYAICHPLLFKSTARR^RGSILGIWAVSLAIMVPQA 

1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 II 

VIPYLQAVSVSVAVLTLSFIALDRWYAICHPLLFKSTARRARGSILGIWAVSLAIMVPQA 


180 


Db 


121 


180 


Qy 


181 


AVMECSSVLPELANRTRLFSVCDERWADDLYPKIYH 

1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 
AVMECSSVLPELANRTRLFSVCHERWADDLYPKIYHSCFFIWYLAPLGLMAMAYFQIFR 


240 


Db 


181 


240 


Qy 


241 


KLWGRQIPGTTSALVRNWKRPSDQLGDLEQGLSGEPQPRARAFLAEVKQMRARRKTAKML 
II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 M 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 
NLWGRQIPGTTSALVRNWKRPSDQLGDLEQGLSGEPQPRGRAFLAEVKQMRARRKTAKML 


300 


Db 


241 


300 


Qy 


301 


MWLLVFALC YLP I S VLNVLKRVFGMFRQAS DREAVYACFT FS HWLVYANS AAN P 1 1 YN F 

1 1 N 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I I | | || | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | || | | 

MWLLVFALC YLP I S VLNVLKRVFGMFRQASDREAVYACFT FSHWLVYAN S AAN P 1 1 YN F 


360 


Db 


301 


360 


Qy 


361 


LSGKFREQFKAAFSCCLPGLGPCGSLKAPSPRSSASHKSLSL 402 

1 1 1 1 1 1 1 1 1 1 M M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 II 1 1 1 1 

LSGKFREQFKAAFSCCLPGLGPCGSLKAPSPRSSASHKSLSL 4 02 




Db 


361 





RESULT 10 
US-09-730-931-2 

; Sequence 2, Application US/09730931 

; Patent No. US20020064814A1 

; GENERAL INFORMATION: 

; APPLICANT: ELLIS, CATHERINE E. 

; TITLE OF INVENTION: DOG OREXIN 1 RECEPTOR 

; FILE REFERENCE: GH-70669 

; CURRENT APPLICATION NUMBER: US/ 09/ 730 , 931 

; CURRENT FILING DATE: 2000-12-06 

; PRIOR APPLICATION NUMBER: 60/169,373 

; PRIOR FILING DATE: 1999-12-07 

; NUMBER OF SEQ ID NOS : 2 

; SOFTWARE: FastSEQ for Windows Version 3.0 
; SEQ ID NO 2 

LENGTH: 427 
; TYPE: PRT 

ORGANISM: CANIS FAMILIARIS 
US-09-730-931-2 

Query Match 94.1%; Score 2087; DB 9; Length 427; 

Best Local Similarity 94.1%; Pred. No. 1.7e-185; 

Matches 402; Conservative 5; Mismatches 18; Indels 2; Gaps 1; 

QY 1 MEPSATPGAQMGVPPGSRE— PSPVPPDYEDEFLRYLWRDYLYPKQYEWVLIAAYVAVFV 58 

I I I I I I M M I I I I M I I I I I I I I I I I I I I I I I I I I I I I I Mill: 

Db 1 MEPSATPGAQTGTPTGGGELSPSLVPPDYEDEFLRYLWRDYLYPKQYEWVLIAAYVAVFL 60 

Qy 59 VALVGNTLVCLAWRNHHMRTVTNYFIWLSLADVLVTAICL 118 

I I I M I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I | | | | | | | | | M I I I I I I I I 
Db 61 VALVGNTLVCLAWRNHHMRTVTNYFIWLSLADVLVTAICLPASLLV^ 120 



Qy 



119 CKVIPYLQAVSVSVAVLTLSFIALDRWYAICHPLLFKSTARRARGSILGIWAVS LAIMVP 178 



Db 


121 


M M 1 1 1 M 1 1 1 1 1 II II 1 1 I I 1 I I I I I | | | | | | | | | | | | | | | | 111111:111 

C KVI P Y LQAVS VS VAVL T L S F I AL D RW YAI CHPLLFKS T ARRAR S S I L G I WAVS LAVMVP 


180 


Qy 


179 


QAAVMECSSVLPELANRTRLFSVCDERWADDLYPKIYHSCFFIVTYLAPLGLMAMAYFQI 
M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I I M 1 1 1 I I I I 1 I I I | | || | | | | | | | | | | | | | | | | | | | | 
QAAVMECSSVLPELANRTRLFSVCDEHWADDLYPKIYHSCFFIVTYLAPLGLMAMAYFQI 


238 


Db 


181 


240 


Qy 


239 


FRKLWGRQIPGTTSALVRNWKRPSDQLGDLEQGLSGEPQPRARAFLAEVKQMRARRKTAK 

1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 

FRKLWGRQIPGTTSALVRNWKRPSDQLEDQGPGLSAEPPPRARAFLAEVKQMRARRKTAK 


298 


Db 


241 


300 


Qy 


299 


MLMWL L VFAL C Y L P I S VLN VL K RV F GM F RQ AS D R EAVYAC FT F S H W L VYAN S AAN P 1 1 Y 

1 1 1 1 : 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 : 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 
MLMVLLLVFALC YL P I S VLNVLKRVFGMFRQ S S DREAVYAC FT FS HWLVYAN S AAN P 1 1 Y 


358 


Db 


301 


360 


Qy 


359 


NFLSGKFREQFKAAFSCCLPGLGPCGSLKAPSPRSSASHKSLSLQSRCSVSKISEHWLT 

M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 : 1 II 1 II 

NFLSGKFREQFKAAFSCCLPGLGPCGSPKAPSPRSSASHKSLSLHSRCSVSKVPEHWLT 


418 


Db 


361 


420 


Qy 


419 


SVTTVLP 425 
1 1 1 1 1 II 
SVTTVLP 427 




Db 


421 





RESULT 11 
US-09-828-538-20 

Sequence 20, Application US/09828538 
Patent No. US20010025031A1 
GENERAL INFORMATION: 
APPLICANT: Ellis, Catherine E. 
APPLICANT: Kwok, Cheni 
APPLICANT: Bodsworth, Nicola J. 
APPLICANT: Halsey, Wendy 
APPLICANT: Van Horn, Stephanie 

TITLE OF INVENTION: HFGAN72 .Receptor Genomic DNA and Methods 
TITLE OF INVENTION: of Use Thereof in Diagnostic Applications 
FILE REFERENCE: GH-50038-C1 

CURRENT APPLICATION NUMBER: US/ 09/ 82 8 , 538 
CURRENT FILING DATE: 2001-04-06 
PRIOR APPLICATION NUMBER: 60/088,624 
PRIOR FILING DATE: 1998-06-08 
PRIOR APPLICATION NUMBER: 60/093,726 
PRIOR FILING DATE: 1998-07-22 
PRIOR APPLICATION NUMBER: 09/328,014 
PRIOR FILING DATE: 1999-06-08 
NUMBER OF SEQ ID N03; 24 

SOFTWARE: FastSEQ for Windows Version 3.0 
SEQ ID NO 20 
LENGTH: 38 9 
TYPE: PRT 

ORGANISM: HOMO SAPIENS 
US-09-828-538-20 



Query Match 85.7%; Score 1902.5; DB 9; Length 389; 

Best Local Similarity 96.1%; Pred. No. 2.2e-168; 

Matches 368; Conservative 2; Mismatches 10; Indels 3; Gaps 



2; 



Q y 1 MEPSATPGAQMGVPPGSREPSPVPPDYEDEFLRYLWRDYLYPKQYEWVLIAAWAVFVVA 60 

I I I I I I I I I I I I I I I I I M I I I I I I I I I I I ! I II I I I I I I I M I | | | | | | | | | | | j | | | | 

Db 1 MEPSATPGAQMGVPPGSREPSPVPPDYEDEFLRYLWRDYLYPKQYEWVLIAAYVAVFWA 60 

6 1 L VGNT LVC LAVWRNHHMRT VTN YFI VNL S IAD VL VT AI C L P AS LLVD I T E S WL FGHALC K 120 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 6 1 LVGNTLVCIAWRNHHMRTVTNYFIWLSrAJDVLVTAICLPASLLVDITESWLFGHALCK 120 

Q y 121 VIp YLQAVSVSVAVLTLSFIALDRWYAICHPLLFKSTARRARGSILGIWAVSLAIMVPQA 180 

I M I I I I I I I I I I I I I I | | I I I I I I I I I I I I I I I I I I I I I I I | I II I II I I I I I 

Db 121 VIP Y LQAVS VS VAVLT L S F I AL D RW YAI CHPLLFKS T ARRARG S I L G I WAVS LAI MVP Q A 180 

18 1 AVMECSSVLPEIANRTRLFSVCDERWADDLYPKIYHSCFFIVTYIAPLGLMA^YFQIFR 240 

HI M I I I II I I I II I I I I I I I I I I I I I I I I I | | | | | | | | | | | | | | | | | | | | | | 

Db 181 AVMECSSVLPELANRTRLFSVCDERWAI)DLYPKIYHSCFFIWYIA.PLGLiy[AMAYFQIFR 24 0 



Qy 241 KLWGRQIPGTTSALVRNWKRPSDQLGDLEQGLSGEPQPRARAFLAEVKQMRARRKTAKML 300 

M I I I I I I I I I I I I I I I I || | | | || | | | | || | | | I I II I I I I I I I I I I 

Db 241 KLWGRQIPGTTSALVRNWKRPSDQLGDLEQGLSGEPQPRGRAFLAEVKQMRARRKTAKML 3 00 

Qy 301 MWLL VFAL C YL P I S VLNVLKRVFGMFRQAS D REAVYAC FT FS HWLVYAN S AAN P 1 1 YN F 360 

HI I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I 

Db 301 MWL L VFALC YL P I S VLNVLKRVFGMFRQAS DREAVYAC FT FS HWLVYAN S AAN P 1 1 YN F 360 

Qy 361 LSGKFREQFKAAFSC-CLPGLGP 382 

Ml : : Ml II I 
Db 361 LSG — CKEKSLALSCPSCPGHDP 381 



RESULT 12 
US-09-211-823C-23 

Sequence 23, Application US/09211823C 
Publication No. US20030087801A1 
GENERAL INFORMATION: 

APPLICANT: HAG EN , JAMES JOSEPH 

APPLICANT: TERRETT, JONATHAN ALEXANDER 

APPLICANT: UPTON, NEIL 

APPLICANT: PIPER, DAVID 

APPLICANT: SMITH, MARTIN IAN 

APPLICANT: KENNETT, GUY ANTHONY 

APPLICANT: PATEL, SARASWATI R. 

TITLE OF INVENTION: METHODS OF TREATMENT USING NOVEL LIGANDS 

TITLE OF INVENTION: OF THE NEUROPEPTIDE RECEPTOR HFGAN72 AND AGONISTS OR 

TITLE OF INVENTION: ANTAGONISTS THEREOF 

FILE REFERENCE: P50745 

CURRENT APPLICATION NUMBER : US/ 0 9/211 , 823C 

CURRENT FILING DATE: 1998-12-15 

PRIOR APPLICATION NUMBER: US 60/069,459 

PRIOR FILING DATE: 1997-12-15 

PRIOR APPLICATION NUMBER: US 60/069,785 

PRIOR FILING DATE: 1997-12-16 

NUMBER OF SEQ ID NOS : 23 

SOFTWARE: FastSEQ for Windows Version 3.0 
SEQ ID NO 23 
LENGTH: 38 9 
TYPE: PRT 

ORGANISM: HOMO SAPIENS 



US-09-211-823C-23 



Query Match 85.7%; Score 1902.5; DB 10; Length 389; 

Best Local Similarity 96.1%; Pred. No. 2.2e-168; 

Matches 368; Conservative 2; Mismatches 10; Indels 3; Gaps 2 

^ 1 MEPSATPG AQMGVPPGSREPSPVPPDYEDEFLRYLWRDYLYPKQYEWVLIAAYVAVFWA 60 

1 I I I I M I I I I I M I I I I I II I I M I I I I I I I I I I I I I I | | M | | | | | | | | | I I I I I I I I 
Db 1 MEPSATPGA QMGVPPGSREPSPVPPDYEDEFLRYLWRDYLYPKQYEWVLIAAYVAVFWA 60 

Qy 61 L VGN T L VC LAVWRN HHMRT VTN Y F I VN L S LAD VL VT AI C L PAS L LVD I T E S WL FGHAL C K 12 0 

„ I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I 

Db 61 1jVGNTLVCLAWRNHHMRTVTNYFIWLSLAI)VLW 120 

Qy 121 VIPYLQAVSVSVAVLTLSFIALDRWYAICHPLLFKSTARRARGSILGIWAVSLAIMVPQA 180 

101 M MINIM MINN MINIMI MM M I II II I II I I I I I M M I I I I I I 

Db 121 VIPYLQAVSVSVAVLTLSFIALDRWYAICHPLLFKSTARRARGSILGIWAVSLAIMVPQA 



180 



Qy 181 
Db 181 



AVTVIECSSVLPELANRTRLFSVCDERWADDLYPK^ 240 

I I I I I I I I I I I I M II I M I II II I II II II II II II I II M I II I I I II II I II 

AVTytECSSVLPEI^RTRLFSVCDERWADDLYPKIYHSCFFIWYLAPLGLMAMAYFQIFR 240 

Qy 241 KLWGRQIPGTTSALVRNWKRPSDQLGDLEQGLSGEPQPRARAFLAEVKQMRARRKTAKML 300 

111 M I II I I II II I II I M || | II M I I I I I II 

KLWGRQIPGTTSALVRNWKRPSDQLGDLEQGLSGEPQPRGRAFLAEVKQMRARRKTAKML 300 



Db 241 



Qy 301 MWL L VFALC YL P I S VLNVL KRVFGMFRQAS D REAVYAC FT FS HW LVYAN S AAN P 1 1 YN F 360 

I I I I I I I II I I I I | | | | || | | | || | || | || M I I II I II I I 

Db 301 MWL LVFALC YL P I S VLNVL KRVFGMFRQAS D REAVYAC FT FS HW LVYAN S AAN P 1 1 YN F 360 

QY 361 LSGKFREQFKAAFSC-CLPGLGP 382 

Ml :: Ml || | 
361 LSG — CKEKSLALSCPSCPGHDP 381 



Db 



RESULT 13 
US-10-077-874-4 

Sequence 4, Application US/10077874 
Publication No. US2002 0115155A1 
GENERAL INFORMATION: 

APPLICANT: Soppet, Daniel et al 

TITLE OF INVENTION: Human Neuropeptide Receptor 
NUMBER OF SEQUENCES: 12 
CORRESPONDENCE ADDRESS: 

ADDRESSEE: Human Genome Sciences, Inc. 
STREET: 9410 Key West Avenue 
CITY: Rockville 
STATE: MD 
COUNTRY: USA 
ZIP : 20850 
COMPUTER READABLE FORM: 

MEDIUM TYPE: Floppy disk 
COMPUTER: IBM PC compatible 
OPERATING SYSTEM: PC-DOS/MS-DOS 

SOFTWARE: Patentln Release #1.0, Version #1.30 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/10/077,874 



FILING DATE: 20-Feb-2002 
CLASSIFICATION: <Unknown> 
; PRIOR APPLICATION DATA: 

APPLICATION NUMBER: 08/462,509 
FILING DATE: 05-JUNE-1995 
ATTORNEY/ AGENT INFORMATION: 
NAME: Wales, Michele M. 
; REGISTRATION NUMBER: 43,975 

REFERENCE/DOCKET NUMBER: PF168P1D1 
TELECOMMUNICATION INFORMATION: 
TELEPHONE: 301-309-8504 
TELEFAX: 301-309-8439 
INFORMATION FOR SEQ ID NO: 4: 
SEQUENCE CHARACTERISTICS: 

LENGTH: 369 amino acids 
; TYPE: amino acid 

TOPOLOGY: linear 
MOLECULE TYPE: protein 
SEQUENCE DESCRIPTION: SEQ ID NO: 4: 
US-10-077-874-4 

Query Match 85.5%; Score 1897; DB 13; Length 369; 

Best Local Similarity 99.7%; Pred. No. 6.8e-168; 



Matches 


362; Conservative 0; Mismatches 1; Indels 0; Gaps 


0 


Qy 


1 


ME P SAT P GAQMGVP PGS RE P S PVP P D YEDE FLRYLWRD YL YP KQ YEWVL I AAYVAVFWA 

M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 [. I I | | | | | | | | | | | 1 II 1 1 1 1 1 I | 

MEPSATPGAQMGVPPGSREPSPVPPDYEDEFLRYLWRDYLYPKQYEWVLIAAYVAVFWA 


60 


Db 


1 


60 


Qy 


61 


LVGNTLVCLAWRNHHMRTVTN YFI VNLS LADVLVTAI CLPASLLVDI TESWLFGHALCK 

1 1 1 1 N 1 1 1 1 1 1 1 1 1 1 1 1 | | | | | | | | || | | | | | | | | | | | | | | | || | | | | | | | | | | |, | | | 

L VGN T L VC LAW RN H HMRT VT N Y F I VN L S LAD VL VT AI C L P AS L LVD I T E S W L F GHAL C K 


120 


Db 


61 


120 


Qy 


121 


VIPYLQAVSVSVAVLTLSFIALDRWYAI CHPLLFKSTARRARGSILGIWAVSLAIMVPQA 

1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 1 1 1 1 1 1 I I I I | | | | | | M 

VIPYLQAVSVSVAVLTLSFIALDRWYAI CHPLLFKSTARRARGSILGIWAVSLAIMVPQA 


180 


Db 


121 


180 


Qy 


181 


AVMECSSVLPELANRTRLFSVCDERWADDLYPKIYHSCFFIVTYLAPLGLMAMAYFQIFR 

1 1 1 1 1 1 1 1 1 1 1 1 I I I | | | | | 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 I I I I I | | | | | 

AVTy[ECSSVLPELANRTRLFSVCDERWAI)DLYPKIYHSCFFIVTYLAPLGLMAMAYFQIFR 


240 


Db 


181 


240 


Qy 


241 


KLWGRQIPGTTSALVRNWKRPSDQLGDLEQGLSGEPQPRARAFLAEVKQMRARRKTAKML 
1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 I | | | | | | | | M | | | | | | | 
KLWGRQIPGTTSALVRNWKRPSDQLGDLEQGLSGEPQPRGRAFLAEVKQMRARRKTAKML 


300 


Db 


241 


300 


Qy 


301 


MWLLVFALC YL P I S VLNVLKRVFGMFRQAS DREAVYAC FT FSHWLVYAN S AAN P 1 1 YNF 

1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 I 1 

MWLLVFALC YL P I SVLNVL KRVFGMFRQAS DREAVYAC FT FS HWLVYAN S AAN P 1 1 YNF 


360 


Db 


301 


360 


Qy 


361 


LSG 363 




Db 


361 


1 1 1 

LSG 363 





RESULT 14 
US-09-393-696-6 

; Sequence 6, Application US/09393696 



; Publication No. US20030022277A1 
; GENERAL INFORMATION: 

/ APPLICANT: Human Genome Sciences,, Inc. et al . 
; TITLE OF INVENTION: Human Neuropeptide Receptor 

FILE REFERENCE: PF168P2 
; CURRENT APPLICATION NUMBER: US/09/393,696 
; CURRENT FILING DATE: 1999-09-10 
; EARLIER APPLICATION NUMBER: PCT/US95/ 05 616 
; EARLIER FILING DATE: 1995-05-05 
; EARLIER APPLICATION NUMBER: US08/462,509 
; EARLIER FILING DATE: 1995-06-05 
; NUMBER OF SEQ ID NOS : 23 
; SOFTWARE: Patentln Ver. 2.0 
; SEQ ID NO 6 

LENGTH: 372 
; TYPE: PRT 

ORGANISM: Homo sapiens 
US-09-393-696-6 

Query Match 85.4%; Score 1894; DB 10; Length 372; 

Best Local Similarity 99.4%; Pred. No. 1.3e-167; 

Matches 361; Conservative 1; Mismatches 1; Indels 0; Gaps 0 

1 ME P SAT P GAQMGVP P G S RE P S P VP P D YE D E FL RYLWRD YL YP KQ YEWVL I AAYVAVFVVA 60 

I I I I I I I I II I I : I I I I I I I f I I I I I I I I I I I I I | | | | | | | | | || | | | | | | | | | 

1 ME P SAT P GAQMGVP P G S RD P S P VP P D YE D E FL RYLWRD YLYP KQ YEWVL I AAYVAVFVVA 60 

61 LVGNTLVCIJVWRNHHMRTWNYFIWLSLADVLVTAICLPASLLVDITESWLFGHALCK 12 0 

I I I M I I I I I I I I I I I I I I I I I I I I I I I M I | I | | M M I I I I I I I I I I I I I I I I 

61 L VGNT L VC LAVWRN HHMRT VTN Y F I VN L S LAD VL VT AI C L PAS L LVD I T E S W L FGHAL C K 12 0 

121 VI P YLQAVS VS VAVLT L S FI AL D RW YAI CH P L L FK S TARRARG S I LG I WAVS LAIMVPQA 18 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I | | | | | | | | | | | | | I I I I I I I I I I I I I I II I I I I 
121 VI PYLQAVSVSVAVLTLSFIALDRWYAICHPLLFKSTARRARGS I LGI WAVS LAIMVPQA 18 0 

181 AVMECSSVLPELANRTRLFSVCDERWADDLYPKIYHSCFFIVTYLAPLGLMAMAYFQIFR 24 0 

N I I I I I I I I I I I I I I I I I I I II I I I I I I I I 

181 AWECSSVLPELANRTRLFSVCDERWADDLYPKIYHSCFFIVTYLAPLGLMAMAYFQIFR 240 

241 KLWGRQIPGTTSALVRNWKRPSDQLGDLEQGLSGEPQPRARAFLAEVKQMRARRKTAKML 300 

I I I M I I I I I I I I I I I I I I I I I I I I I I | | | | I I I I I I I I I I I I I I I I I I I I I I I ! I I I | 

241 KLWGRQIPGTTSALVRNWKRPSDQLGDLEQGLSGEPQPRGRAFLAEVKQMRARRKTAKML 300 

301 MWLLVFALC YLP I SVLNVLKRVFGMFRQAS DREAVYAC FT FSHWLVYANS AANP 1 1 YN F 360 

N M I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I 

301 MWLLVFALC YLP I S VTLNVLKRVFGMFRQAS DREAVYAC FT FSHWLVYAi^ S AANP 1 1 YN F 360 

361 LSG 363 
I I I 

361 LSG 363 



15 

US-10-077-874-6 

; Sequence 6, Application US/10077874 
; Publication No. US20020115155A1 
GENERAL INFORMATION: 



Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

RESULT 



APPLICANT: Soppet, Daniel et al 

TITLE OF INVENTION: Human Neuropeptide Receptor 
NUMBER OF SEQUENCES: 12 
CORRESPONDENCE ADDRESS: 

ADDRESSEE: Human Genome Sciences, Inc. 
STREET : 9410 Key West Avenue 
CITY: Rockville 
; STATE: MD 

COUNTRY: USA 
ZIP: 20850 
; COMPUTER READABLE FORM: 

MEDIUM TYPE: Floppy disk 
; COMPUTER: IBM PC compatible 

OPERATING SYSTEM: PC-DOS/MS-DOS 
; SOFTWARE: Patentln Release #1.0, Version #1.30 

CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/10/077, 874 
FILING DATE: 20-Feb-2002 
CLASSIFICATION: <Unknown> 
; PRIOR APPLICATION DATA: 

APPLICATION NUMBER: 08/462,509 
FILING DATE: 05-JUNE-1995 
ATTORNEY/ AGENT INFORMATION: 
; NAME: Wales, Michele M. 

REGISTRATION NUMBER: 43,975 
; REFERENCE/ DOCKET NUMBER: PF168P1D1 

TELECOMMUNICATION INFORMATION: 
TELEPHONE: 301-309-8504 
TELEFAX: 301-309-8439 
INFORMATION FOR SEQ ID NO: 6: 
SEQUENCE CHARACTERISTICS: 

LENGTH: 372 amino acids 
TYPE: amino acid 
TOPOLOGY: linear 
MOLECULE TYPE: protein 
SEQUENCE DESCRIPTION: SEQ ID NO: 6: 
US-10-077-874-6 



Query Match 85.4%; Score 1894; DB 13; Length 372; 

Best Local Similarity 99.4%; Pred. No. 1.3e-167; 

Matches 361; Conservative 1; Mismatches 1; Indels 0; Gaps 0; 



Qy 


1 


MEP S AT PGAQMGVP P GS REP S P VP PD YEDEFLRYLWRD YL YP KQ YEWVL I AAYVAVFWA 

IIMIIIIIIMIMIMHIllMIIIII MIIIMIIIIII Illllll 

MEP S AT PGAQMGVPPGSRDPSPVPPDYEDEFLRYLWRDYLYPKQYEWVLI AAYVAVFWA 


60 


Db 


1 


60 


Qy 


61 


LVGNTLVCIAWRNHHMRTVTNYFIWLSI^VLVTAICLPASLLVDITESWLFGHALCK 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I I I | | | | | | || | | m | 1 1 1 1 1 1 1 1 1 I I I I I I | | | | | M 1 1 

LVGNTLVCLAWRNHHMRTWNYFIWLSLADVLVTAICLPASLLVDITESWLFGHALCK 


120 


Db 


61 


120 


Qy 


121 


VIPYLQAVSVSVAVLTLSFIALDRWYAICHPLLFKSTARRARGSILGIWAVSLAIMVPQA 

MIMMIIIMIMIIMI 1 1 1 1 1 1 1 1 1 1 1 1 1 || 

VIPYLQAVSVSVAVLTLSFIALDRWYAICHPLLFKSTARRARGSILGIWAVS LAIMVPQA 


180 


Db 


121 


180 


Qy 


181 


AWECSSVLPEIA^RTRLFSVCDERWADDLYPKIYHSCFFIWYIxAPLGLMAMAYFQIFR 

N 1 1 1 1 1 1 1 1 1 1 1 t 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

AVMECSSVLPELANRTRLFSVCDERWADDLYPKIYHSCFFIVTYLAPLGLMAMAYFQIFR 


240 


Db 


181 


240 



Qy 241 KLWGRQIPGTTSALVRNWKRPSDQLGDLEQGLSGEPQPRARAFIAEVKQMRARRKTAKML 300 
I I I I I I I I I I I I I I I I I I I I | | | | | | | | | | | | | | | | | | | | | [ | | ! | I | | | | | | | 

Db 241 KLWGRQIPGTTSALVRNWKRPSDQLGDLEQGLSGEPQPRGRAFLAEVKQMRARRKTAKML 300 

Qy 301 MWLLVFALC YL P I S VLNVLKRVFGMFRQAS DREAVYAC FT FSHWLVYANS AAN P 1 1 YNF 360 

_ I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 301 MWLLVFALCYLPISVLNVLKRVFGMFRQASDREAVYACFTFSHWLVYANSAANPIIYNF 360 

Qy 361 LSG 363 

I I I 

Db 361 LSG 363 



Search completed: September 28, 2004, 10:18:29 
Job time : 53 sees 



GenCore version 5.1.6 
Copyright (c) 1993 - 2004 Compugen Ltd. 



OM protein - protein search, using sw model 



Run on: 



Title: 

Perfect score: 
Sequence : 



September 28, 2004, 09:37:36 ; Search time 120 Seconds 

(without alignments) 
1117.460 Million cell updates/sec 

US-10-070-532-2 
2219 

1 ME P SAT P GAQMGVP P GS RE P CSVSKISEHWLTS VTTVLP 425 



Scoring table: BLOSUM62 

Gapop 10.0 , Gapext 0.5 



Searched: 



1017041 seqs, 315518202 residues 



Total number of hits satisfying chosen parameters: 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 



1017041 



Database 



SPTREMBL 25: 



1 
2 
3 
4 
5 
6 
7 
8 
9 

10 
11 
12 
13 
14 
15 
16 
17 



sp_archea : * 
sp_bacteria : * 
sp_f ungi : * 
sp_human: * 
sp_invertebrate : * 
sp_mammal : * 
sp_mhc : * 
sp_organelle : * 
sp_phage : * 

sp_plant : * 

sp_rodent : * 

sp_virus : * 

sp_vertebrate : * 

sp_unclassif ied : * 

sp_rvirus : * 

sp_bacteriap : * 

sp_archeap : * 



Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 



SUMMARIES 



% 

Result Query 

No. Score Match Length DB 



ID 



Description 



1 


2218 


100. 


, 0 


425 


4 


Q9HBV6 


Q9hbv6 homo sapien 


2 


1287 


58 . 


0 


364 


11 


Q8BV78 


Q8bv7 8 mus musculu 


3 


976 


44. 


0 


199 


11 


Q80T45 


Q80t45 mus musculu 


4 


659.5 


29. 


7 


166 


6 


Q8MJ13 


Q8mjl3 ovis aries 


5 


618 


27. 


9 


127 


6 


Q8SPR4 


Q8spr4 ovis aries 


6 


533 


24 . 


0 


109 


6 


Q8I010 


Q8i010 bos taurus 


7 


512 


23 . 


1 


417 


11 


Q8BKR6 


Q8bkr6 mus musculu 


8 


509 


22. 


9 


417 


11 


Q924H0 


Q924h0 mus musculu 


9 


508.5 


22. 


9 


405 


11 


Q924N0 


Q924n0 mus musculu 


10 


480 


21. 


6 


432 


11 


Q924G9 


Q924g9 rattus norv 


11 


473.5 


21 . 


3 


758 


5 


Q7YU49 


Q7yu49 drosophila 


12 


452.5 


20 . 


4 


449 


5 


Q9VB87 


Q9vb87 drosophila 


13 


446 


20. 


1 


375 


13 


057463 


057463 brachydanio 


14 


446 


20. 


1 


517 


5 


Q9VWR3 


Q9vwr3 drosophila 


15 


444 


20. 


0 


540 


5 


Q9VRM0 


Q9vrm0 drosophila 


16 


443.5 


20. 


0 


436 


13 


Q7T1P8 


Q7tlp8 gallus gall 


17 


438.5 


19. 


8 


393 


13 


Q7T078 


Q7t078 fugu rubrip 


18 


435.5 


19. 


6 


381 


11 


Q9ERC0 


Q9erc0 rattus norv 


19 


434 .5 


19. 


6 


381 


11 


Q8BWV1 


Q8bwvl mus musculu 


20 


433.5 


19. 


5 


678 


5 


Q94736 


Q94736 stomoxys ca 


21 


428 


19. 


3 


373 


13 


073734 


073734 brachydanio 


22 


418.5 


18. 


9 


397 


5 


Q9NHA4 


Q9nha4 boophilus m 


23 


417 


18. 


8 


407 


11 


Q8BYR7 


Q8byr7 mus musculu 


24 


416 


18. 


7 


598 


5 


Q9VWQ9 


Q9vwq9 drosophila 


25 


415 


18. 


7 


519 


5 


Q9VAD2 


Q9vad2 drosophila 


26 


414 


18. 


7 


453 


11 


Q8BKF6 


Q8bkf6 mus musculu 


27 


413 


18. 


6 


436 


11 


Q8VCC7 


Q8vcc7 mus musculu 


28 


412 


18. 


6 


374 


13 


Q9YHX1 


Q9yhxl gadus morhu 


29 


412 


18. 


6 


411 


13 


Q9W6I3 


Q9w6i3 gallus gall 


30 


412 


18. 


6 


504 


5 


Q9VGX8 


Q9vgx8 drosophila 


31 


412 


18. 


6 


517 


5 


Q8T0S8 


Q8t0s8 drosophila 


32 


411.5 


18. 


5 


450 


11 


P89005 


P89005 praomys nat 


33 


411 


18. 


5 


402 


5 


Q964E5 


Q964e5 dugesia tig 


34 


411 


18. 


5 


436 


11 


Q9DBV6 


Q9dbv6 mus musculu 


35 


409.5 


18. 


5 


365 


11 


Q8BHH0 


Q8bhh0 mus musculu 


36 


406.5 


18. 


3 


440 


11 


Q925R4 


Q925r4 cavia porce 


37 


406 


18. 


3 


431 


5 


Q8T8D1 


Q8t8dl urechis uni 


38 


405 


18. 


3 


429 


5 


P92045 


P92045 lymnaea sta 


39 


404.5 


18. 


2 


398 


4 


Q969F8 


Q969f8 homo sapien 


40 


403 . 5 


18 . 


2 


422 


11 


Q8VHD7 


Q8vhd7 rattus norv 


41 


402.5 


18. 


1 


398 


4 


Q96QG0 


Q96qg0 homo sapien 


42 


400.5 


18. 


0 


396 


11 


Q924U1 


Q924ul rattus norv 


43 


400.5 


18. 


0 


398 


4 


Q8NGQ8 


Q8ngq8 homo sapien 


44 


399 


18. 


0 


395 


11 


Q9Z0T7 


Q9z0t7 rattus norv 


45 


399 


18. 


0 


396 


4 


Q92492 


Q92492 homo sapien 



ALIGNMENTS 



RESULT 1 
Q9HBV6 

ID Q9HBV6 PRELIMINARY; PRT; 425 AA. 

AC Q9HBV6; 

DT 01-MAR-2001 (TrEMBLrel. 16, Created) 

DT 01-MAR-2001 (TrEMBLrel. 16, Last sequence update) 

DT 01-OCT-2003 (TrEMBLrel. 25, Last annotation update) 



DE 
GN 
OS 
OC 
OC 
OX 
RN 
RP 
RX 
RA 

RA 
RA 
RT 
RT 
RL 
RN 
RP 
RA 

£ 

RN 
RP 
RX 
RA 
RA 
RT 
RL 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
KW 
SQ 



Craniata ; Vertebrata ; Euteleos tomi ; 
Catarrhini; Hominidae; Homo. 



Hypocretin receptor-1. 
HCRTR1. 

Homo sapiens (Human) . 
Eukaryota; Metazoa; Chordata; 
Mammalia; Eutheria; Primates; 
NCBI_TaxID=9606; 
[1] 

SEQUENCE FROM N.A. 

MEDLINE=2 0429525; PubMed=l 097331 8 ; 

Peyron C, Faraco J., Rogers W., Ripley B., Overeem S., Charnay Y. , 
Nevsimalova S., Aldrich M. , Reynolds D. , Albin R., Li R. , Hungs M. , 
Pedrazzoli M. , Padigaru M. , Kucherlapati M. , Fan J., Maki R. , 
Lammers G.J., Bouras C, Kucherlapati R., Nishino S., Mignot E. ; 
"A mutation in a case of early onset narcolepsy and a generalized 
absence of hypocretin peptides in human narcoleptic brains."; 
Nat. Med. 6:991-997(2000). 
[2] 

SEQUENCE FROM N.A. 

Yeager M. , Welch R., Haque K., Bergen A.; 
"Genomic sequence of the hypocretin (orexin) receptor 1 
Submitted (DEC-2001) to the EMBL/ GenBank/DDBJ databases. 
[3] 

SEQUENCE FROM N.A. 

MEDLINE=2158 0342; PubMed=l 172 3285 ; 

Olafsdottir B.R., Rye D.B., Scammell T.E., Matheson J.K., 
Stefansson K. , Gulcher J.R.; 

"Polymorphisms in hypocretin/orexin pathway genes and narcolepsy."; 
Neurology 57:1896-1899(2001). 
EMBL; AF202084; AAG28020.1; 

JOINED. 
JOINED. 
JOINED. 
JOINED. 
JOINED. 
JOINED. 



(HCRTR1) 



EMBL; AF202078; 
EMBL; AF202079; 



EMBL; 
EMBL; 
EMBL; 
EMBL; 
EMBL; 
EMBL; 
GO; 
GO; 
GO; 
GO; 
GO; 



AAG28020.1; 
AAG28020.1; 
AF202080; AAG28020.1; 
AF202081; AAG28020.1; 

AAG28 020.1; 
AAG28020.1; 
AAL50221.1; 



AF2020 82; 
AF202083; 
AY070269; 



AY062030; AAL47214.1; 
GO:0016021; 
GO:0016499; 
GO:0004872; 



C: integral to membrane; IEA. 
F: orexin receptor activity; IEA. 
F: receptor activity; IEA. 
GO: 0001584; F: rhodopsin-like receptor activity; IEA. 
GO: 0007186; P:G-protein coupled receptor protein signalin. 
InterPro; IPR000276; GPCR_Rhodpsn . 
InterPro; IPR004059; Orexin_receptorl . 
Pfam; PF00001; 7tm_l; 1. 
PRINTS; PR00237; GPCRRHODOPSN . 
PRINTS; PR01521; OREXIN1R. 

PROSITE; PS00237; G_PROTEIN_RECEP_Fl_l ; 1. 
PROSITE; PS50262; G_PROTEIN_RECEP_Fl_2 ; 1. 
Receptor . 

SEQUENCE 425 AA; 47535 MW; B650B37F3A2CA096 CRC64; 



IEA. 



Query Match 100.0%; Score 2218; DB 4 

Best Local Similarity 99.8%; Pred. No. 6.1e-186 
Matches 424; Conservative 1; Mismatches 0 



Length 425; 

Indels 0; Gaps 0 



Qy 1 MEPSATPGAQMGVPPGSREPSPVPPDYEDEFLRYLWRDYLYPKQYEWVLIAAYVAVFWA 60 



Db 1 ME P S AT PGAQMGVP PGS REP S P VP P D YEDEFLRYLWRD YL YP KQ YEWVL I AAYVAVFWA 60 

QY 61 LVGNTLVCIAWRNHHMRTVTNYFIWLSI^VLVTAICLPASLLVDITESWLFGIiALCK 120 

I I I I M I I I I I II I I I II I I I I I I I I I I I | I I I II I I I I I I I I I I II I I I I I I I I I | I I I 
Db 61 LVGNTLVCLAWRNHHMRTVTNYFIWLSLADVTjVTAICLPASLLVDITESWLFGHALCK 120 

QY 121 VIPYLQAVSVSVAVLTLSFIALDRWYAICHPLLFKSTARRARGSILGIWAVSLAIMVPQA 180 

1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 II 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

Db 121 VI PYLQAVSVSVAVLTLS FIALDRWYAICHPLLFKSTARRARGSI LGIWAVS LAIMVPQA 180 

QY 181 AVMECSSVLPELANRTRLFSVCDERWADDLYPKIYHSCFFIVTYLAPLGLMAMAYFQIFR 240 

M I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 181 AVMECSSVLPELANRTRLFSVCDERWADDLYPKIYHSCFFIVTYLAPLGLMAMAYFQIFR 240 

Qy 241 KLWGRQIPGTTSALVRNWKRPSDQLGDLEQGLSGEPQPRARAFLAEVKQMRARRKTAKML 300 

I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 241 KLWGRQIPGTTSALVRNWKRPSDQLGDLEQGLSGEPQPRARAFLAEVKQMRARRKTAKML 300 

Qy 301 MWLLVFALC YL PI S VLNVLKRVFGMFRQAS DREAVYAC FT FSHWLVYANSAANP 1 1 YNF 360 

I M I I I I I I II I I I I I I I I I I I I I I I I I | | | | | | | | | | | | | | | | | | | | | | | | M I I I I I I 
Db 301 MWL LVFAL C YL P I S VLN VL K RVFGM FRQAS DREAVYAC FT FS HW LVYAN S AAN P 1 1 YN F 360 

Qy 361 LSGKFREQFKAAFSCCLPGLGPCGSLKAPSPRSSASHKSLSLQSRCSVSKISEHWLTSV 420 

I I I I I M I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I II I II I I I : I I I I I I I | | | | | 
Db 361 LSGKFREQFKAAFSCCLPGLGPCGSLKAPSPRSSASHKSLSLQSRCSISKISEHWLTSV 420 

Qy 421 TTVLP 425 

I I I I I 

Db 421 TTVLP 425 



RESULT 2 
Q8BV7 8 



ID Q8BV78 PRELIMINARY; PRT; 364 AA. 

AC Q8BV78; 

DT 01-MAR-2003 (TrEMBLrel. 23, Created) 

DT 01-MAR-2003 (TrEMBLrel. 23, Last sequence update) 

DT 01-OCT-2003 (TrEMBLrel. 25, Last annotation update) 

DE Orexin receptor type 2. 

GN MOX2R. 

OS Mus musculus (Mouse) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleos tomi ; 

OC Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Mus. 

OX NCBI_TaxID=10090; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC STRAIN=C57BL/6J; TISSUE=Hypothalamus ; 

RX MEDLINE=22354683; PubMed=124 66851 ; 

RA The FANTOM Consortium, 

RA the RIKEN Genome Exploration Research Group Phase I & II Team; 

RT "Analysis of the mouse transcriptome based on functional annotation of 

RT 60,770 full-length cDNAs . " ; 

RL Nature 420:563-573(2002). 

DR EMBL; AK079572; BAC37688.1; -. 

DR MGD; MGI: 1889024; Mox2r. 

DR GO; GO: 0016021; C:integral to membrane; IEA. 



DR GO; GO: 0001584; F : rhodopsin-like receptor activity; IEA. 

DR GO; GO: 0007186; P:G-protein coupled receptor protein signalin. . . ; IEA. 

DR InterPro; IPR000276; GPCR_Rhodpsn . 

DR Pfam; PF00001; 7tm_l; 1. 

DR PRINTS; PR00237; GPCRRHODOPSN . 

DR PROSITE; PS00237; G_PROTEIN_RECEP_Fl_l ; 1. 

DR PROSITE; PS50262; G_PROTEIN_RECEP_Fl_2 ; 1. 

SQ SEQUENCE 364 AA; 42018 MW; 4EB7 84 85DBFABDB4 CRC64; 

Query Match 58.0%; Score 12 87; DB 11; Length 364; 

Best Local Similarity 70.1%; Pred. No. 1.8e-104; 

Matches 244; Conservative 40; Mismatches 54; Indels 10; Gaps 4 

Qy 17 SREPSPVPPDYED-EFLRYLWRDYLYPKQYEWVLIAAYVAVFWALVGNTLVCLAVWRNH 75 

I I I I I : I I I I I I I I I : I I : I I : I I II I I I I : I I I I I I : I I I I I : I I I : I I 
Db 24 TQEPFLNPTDYDDEEFLRYLWREYLHPKEYEWVLIAGYIIVFVVALIGNVLVCVAVWKNH 83 

Qy 76 HMRT VTN Y F I VN L S LADVLVT AI C L PAS LLVD I T E S WL FGHAL CKVT P YLQAVS VS VAVL 135 

I I I I I I I I I I I I I I I I II I I : I : I I I I I : I II : II I II I I I I I I I I I : I I 

Db 8 4 HMRTVTNYFIWLSLADVLVTITCLPATLWDITETWFFGQSLCKVIPYLQTVSVSVSVL 14 3 

Qy 136 TLSFIALDRWYAICHPLLFKSTARRARGSILGIWAVS LAIMVPQAAVMECSSVLPELANR 195 

IN I I I I I I I I I I I I I : I I I I I : I I I ||: | | | | I I : I , I | | | | | | : | | | | | : 
Db 144 TLSCIALDRWYAICHPLMFKSTAKRARNSIWIWIVSCIIMIPQAIVMECSSMLPGLANK 2 03 

Qy 196 TRLFSVCDERWADDLYPKIYHSCFFIVTYLAPLGLMAMAYFQIFRKLWGRQIPGTTSALV 255 

I IhlMI I : : I I I : I I I I I : I I I : II I II : | | I I I I I I I I I I I I I : I : 
Db 204 TT L FT VC DEHWGGEVYP KMYH I C F FLVT YMAP L FLMI LAYLQ I FRKLWC RQ I P GT S S WQ 263 

Qy 256 RNWK--RPSDQLGDLEQGLSGEPQPRARAFLAEVKQMRARRKTAKMLMWLLVFALCYLP 313 

Ml : I I : I : : | | I I : I I : I I I I I I I : I I I I I I I I I I : I I I I 

Db 264 RKWKQQQPVSQ PRGSGQQSKARVSAVAAEIKQIRARRKTARMLMWLLVFAICYLP 319 



Qy 314 ISVLNVLKRVFGMFRQASDREAVYACFTFSHWLVYANSAANPIIYNFL 361 

I I : I I I I I I I I I I I I II I I I I II I I I I I I I I I I : I 
Db 320 ISILNVLKRVFGMFTHTEDRETVYAWFTFPHWLVYANSCCKP NYL 364 



RESULT 3 
Q80T45 

ID Q80T45 PRELIMINARY; PRT; 199 AA. 

AC Q8 0T4 5; 

DT 01-JUN-2003 (TrEMBLrel. 24, Created) 

DT 01-JUN-2003 (TrEMBLrel. 24 , Last sequence update) 

DT 01-OCT-2003 (TrEMBLrel. 25, Last annotation update) 

DE Orexin receptor type 1 (Fragment) . 

OS Mus musculus (Mouse) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Mus. 

OX NCBI_TaxID=10090; 

RN [1] 

RP SEQUENCE FROM N.A. 

RX MEDLINE=22584407; PubMed-1267 9517 ; 

RA Vassilatis D.K., Hohmann J.G., Zeng H., Li F . , Ranchalis J.E., 

RA Mortrud M.T., Brown A. , Rodriguez S.S., Weller J.R., Wright A.C., 

RA Bergmann J.E., Gaitanaris G.A. ; 

RT "The G Protein-Coupled Receptor Repertoires of Human and Mouse."; 



RL Proc. Natl. Acad. Sci. U.S.A. 100:4903-4908(2003). 

DR EMBL; AY255599; AA085111.1; -. 

DR GO; GO: 0016021; C:integral to membrane; IEA. 

DR GO; GO: 0004872; F: receptor activity; IEA. 

DR GO; GO: 0001584; F: rhodopsin-like receptor activity; IEA. 

DR GO; GO: 0007186; P:G-protein coupled receptor protein signalin. . .; IEA. 

DR InterPro; IPR000276; GPCR_Rhodpsn . 

DR PRINTS; PR00237; GPCRRHODOPSN . 

DR PROSITE; PS50262; G_PROTEIN_RECEP Fl 2; 1. 



KW Receptor. 

FT NON_TER 1 1 

FT NON_TER 199 199 

SQ SEQUENCE 199 AA; 22773 MW; 8 1 905 8 94 14A8 1185 CRC64; 



Query Match 44.0%; Score 976; DB 11; Length 199; 

Best Local Similarity 94.5%; Pred. No. 1.6e-77; 

Matches 188; Conservative 3; Mismatches 8; Indels 0; Gaps 



Qy 


141 


ALDRWYAICHPLLFKSTARRARGSILGIWAVSLAIMVPQAAVMECSSVLPELANRTRLFS 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 Ill: 1 1 1 1 1 1 1 1 II 1 I I 1 1 1 1 1 1 1 

ALDRWYAICHPLLFKSTAi^RARGSILGIWAVSIAVMVPQAA 


200 


Db 


1 


60 


Qy 


201 


VCDERWADDLYPKIYHSCFFIVTYIAPLGLMAMAYFQIFRKLWGRQIPGTTSALVRNWKR 
INI 1 1 1: 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 
VCDEHWADELYPKIYHSCFFIWYLAPLGLMGMAYFQIFRKLWGRQIPGTTSALVRNWKR 


260 


Db 


61 


120 


Qy 


261 


PSDQLGDLEQGLSGEPQPRARAFLAEVKQMRARRKTAKMLMWLLVFALCYLPISVLNVL 
H:ll Ml 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 I I I | 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 
PSEQLEAQHQGLCTEPQPRARAFLAEVKQMRARRKTAKMLMWLLVFALCYLPISVLNVL 


320 


Db 


121 


180 


Qy 


321 


KRVFGMFRQAS DREAVYAC 33 9 
1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 
KRVFGMFRQAS DREAVYAC 199 




Db 


181 





RESULT 4 
Q8MJ13 

ID Q8MJ13 PRELIMINARY; PRT; 166 AA. 

AC Q8MJ13; 

DT 01-OCT-2002 (TrEMBLrel. 22, Created) 

DT 01-OCT-2002 (TrEMBLrel. 22, Last sequence update) 

DT 01-JUN-2003 (TrEMBLrel. 24, Last annotation update) 

DE Orexin receptor 2 (Fragment) . 

OS Ovis aries (Sheep) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleos tomi ; 

OC Mammalia; Eutheria; Cetartiodactyla ; Ruminantia; Pecora; Bovoidea; 

OC Bovidae; Caprinae; Ovis. 

OX NCBI_TaxI D=9 940; 

RN [ 1 ] 

RP SEQUENCE FROM N.A. 

RA Zhang S., Blackberry M.A. , Blache D. ; 

RL Submitted (JUL-2002) to the EMBL/ GenBank/DDBJ databases. 

DR EMBL; AF532967; AAM97918.1; -. 

DR GO; GO: 0016021; C:integral to membrane; IEA. 

DR GO; GO: 0004872; F: receptor activity; IEA. 

DR GO; GO: 0001584; F: rhodopsin-like receptor activity; IEA. 

DR GO; GO: 0007186; P:G-protein coupled receptor protein signalin. . .; IEA. 



DR 
DR 
DR 
DR 
DR 
KW 
FT 
FT 
SQ 



InterPro; IPR000276; GPCR_Rhodpsn . 
Pfam; PF00001; 7tm_l; 1. 
PRINTS; PR00237; GPCRRHODOPSN . 
PROSITE; PS00237; G PROTEIN RECEP Fl 



1; 



PROSITE; PS50262; G_PROTEIN_RECEP_Fl_2 ; 
Receptor . 
NONJTER 1 
NONJTER 166 
SEQUENCE 166 AA; 



1. 

1. 



1 

166 
19127 MW; 



DEAB9D51D14727E1 CRC64; 



Query Match 29.7%; Score 659.5; DB 6; Length 166; 

Best Local Similarity 78.3%; Pred. No. 6.9e-50; 
Matches 123; Conservative 17; Mismatches 16; Indels 



Qy 

Db 

Qy 

Db 

Qy 

Db 



1; Gaps 

17 S RE P S PVP PD YED- EFLRYLWRD YL Y P KQ YEWVL I AAYVAVFWALVGNT LVC LAVWRNH 
• • I I I I I • I I I I I I I I I : I I : I I : I I I I I I I t : I I I I II : I I I I I I I I I : I I 
9 TQEPFLNPTDYDDEEFLRYLWREYLHPKEYEWVLIAGYIIVFWALIGNVLVCLAVWKNH 



75 



68 



76 HMRT VTN YFI VNL S LADVLVTAI CL PAS LLVDI T ESWL FGHALCKVI P YLQAVS VS VAVL 135 

I I I I I I I I I I I I I I I I I I I I I I I II : I : I I I I I : I I I : I I I I I I I I I I I I I I : I I 
69 HMRTVTNYFIWLSIJU)VLVTITCLPATLVVDITETWFFGQSLCKVIPYLQTVSVSVSVL 128 

136 TLS FI ALDRWYAI CHPLLFKSTARRARGS I LGIWAVS 172 

Ml I I I I I I I I I I I I I : I II I I : I I I ||: | | | | 
129 TLSCIALDRWYAICHPLMFKSTAKRARNSIVIIWIVS 165 



RESULT 5 
Q8SPR4 

ID Q8SPR4 PRELIMINARY; PRT; 127 AA. 

AC Q8SPR4; 

DT 01-JUN-2002 (TrEMBLrel. 21, Created) 

DT 01-JUN-2002 (TrEMBLrel. 21, Last sequence update) 

DT 01-JUN-2003 (TrEMBLrel. 24, Last annotation update) 

DE Orexin receptor 1 (Fragment) . 

OS Ovis aries (Sheep) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Cetartiodactyla ; Ruminantia; Pecora; Bovoidea; 

OC Bovidae; Caprinae; Ovis. 

OX NCBI_TaxID=994 0 ; 

RN [1] 

RP SEQUENCE FROM N.A. 

RA Zhang S., Blackberry M. A. , Blache D. ; 

RL Submitted (APR-2002) to the EMBL/ GenBank/DDBJ databases. 

DR EMBL; AF499612; AAM18967.1; -. 

DR GO; GO: 0016021; C: integral to membrane; IEA. 

DR GO; GO: 0004872; F: receptor activity; IEA. 

DR GO; GO: 0001584; F: rhodopsin-like receptor activity; IEA. 

DR GO; GO: 0007186; P:G-protein coupled receptor protein signalin. . .; IEA. 

DR InterPro; IPR000276; GPCR_Rhodpsn . 

DR Pfam; PF00001; 7tm_l; 1. 

KW Receptor. 

FT NON_TER 1 1 

FT NONJTER 127 127 

SQ SEQUENCE 127 AA; 14421 MW; DC5446C6BFE34417 CRC64; 



Query Match 



27.9%; Score 618; DB 6; Length 127; 



Best Local Similarity 92.9%; Pred. No. 2.2e-46; 

Matches 118; Conservative 2; Mismatches 7; Indels 0; Gaps 0; 



Qy 


167 


Db 


1 


Qy 


227 


Db 


61 


Qy 


287 


Db 


121 



GIWAVSLAIMVPQAAViyiECSSVLPEIJ^NRTRLFSVCDERWADDLYPKIYHSCFFIvTYLA 226 

Ms I I I I I I I I I I I I I I I I I M II I I I I I I I I I I | | I : | | I I I 

GIWAVSLAV1WPQAAVMECSSVLPE 60 



M M I I I I I I I I I I II I I I I I I I I || || | Ml | | | | | | | | | | | | 



I I I I I I 



RESULT 6 
Q8I010 

ID Q8I010 PRELIMINARY; PRT; 109 AA. 

AC Q8I010; 

DT 01-MAR-2003 (TrEMBLrel. 23, Created) 

DT 01-MAR-2003 (TrEMBLrel. 23, Last sequence update) 

DT 01-OCT-2003 (TrEMBLrel. 25, Last annotation update) 

DE Orexin receptor 1 (Fragment) . 

GN OX1R. 

OS Bos taurus (Bovine) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleos tomi ; 

OC Mammalia; Eutheria; Cetartiodactyla ; Ruminantia; Pecora; Bovoidea; 

OC Bovidae; Bovinae; Bos. 

OX NCBI_TaxID=9 913; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC TISSUE=Hypothalamus ; 

RA Watanabe K., Aso H. , Sato T., Tahara K., Takano S., Yamaguchi T.; 

RT "Expression of orexin receptor 1 in bovine hypothalamus."; 

RL Submitted (SEP-2 002) to the EMBL/ GenBank/DDBJ databases. 

DR EMBL; AB092488; BAC16765.1; -. 

DR GO; GO: 0016021; C: integral to membrane; IEA. 

DR GO; GO: 0004872; F: receptor activity; IEA. 

DR GO; GO: 0001584; F: rhodopsin-like receptor activity; IEA. 

DR GO; GO: 0007186; P:G~protein coupled receptor protein signalin. . .; IEA. 

DR InterPro; IPR000276; GPCR_Rhodpsn . 

DR Pfam; PF00001; 7tm_l; 1. 

KW Receptor. 

FT NON_TER 1 1 

FT NON_TER 109 109 

SQ SEQUENCE 109 AA; 12442 MW; E8C239847783926B CRC64; 

Query Match 24.0%; Score 533; DB 6; Length 109; 

Best Local Similarity 92.7%; Pred. No. 5.2e-39; 

Matches 101; Conservative 2; Mismatches 6; Indels 0; Gaps 0; 

Qy WAVSLAIMVPQAAVTVIECSSVLPELANRTRLFSVCDERW 228 

I I N I I : I I I I I I ! II I I It I I I I I | | | | | | I II I I I I I I I I I | I | I | | | | I I I I I I I I I 
Db 1 WAVSLAVTWPQAAVT4ECSSVXPELANRTRLFSVCDE 60 



Qy 229 GLMAMAYFQIFRKLWGRQIPGTTSALVRNWKRPSDQLGDLEQGLSGEPQ 277 



M I I M M I M (Mi . Ml Ml 

Db 61 GLMAMAYFQIFRKLWGRQIPGTTSALVRNWKRPSVQLEEQGQGLCTEPQ 109 

RESULT 7 
Q8BKR6 



ID Q8BKR6 PRELIMINARY; PRT; 417 AA. 

AC Q8BKR6; 

DT 01-MAR-2003 (TrEMBLrel. 23, Created) 

DT 01-MAR-2003 (TrEMBLrel. 23, Last sequence update) 

DT 01-OCT-2003 (TrEMBLrel. 25, Last annotation update) 

DE Neuropeptide NPFF receptor. 

GN GPR74. 

OS Mus musculus (Mouse) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Mus. 

OX NCBI_TaxID=10090; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC STRAIN=C57BL/6J; TISSUE=Body; 

RX MEDLINE=22354683; PubMed=12466851; 

RA The FANTOM Consortium, 

RA the RIKEN Genome Exploration Research Group Phase I & II Team; 

RT "Analysis of the mouse transcriptome based on functional annotation of 

RT 60,770 full-length cDNAs . " ; 

RL Nature 420:563-573(2002). 

DR EMBL; AK050939; BAC34468.1; 

DR MGD; MGI: 1860130; Gpr74. 

DR GO; GO: 0016021; Crintegral to membrane; IEA. 

DR GO; GO: 0008188; F : neuropeptide receptor activity; IEA. 

DR GO; GO: 0001584; F : rhodopsin-like receptor activity; IEA. 

DR GO; GO: 0007186; P : G-protein coupled receptor protein signalin. . .; IEA. 

DR InterPro; IPR000276; GPCR__Rhodpsn . 

DR InterPro; IPR005395; NPFF_receptor . 

DR InterPro; IPR005397; NPFF_receptor2 . 

DR Pfam; PF00001; 7tm_l; 1. 

DR PRINTS; PR00237; GPCRRHODOPSN . 

DR PRINTS; PR01570; NPFFRECEPTOR . 

DR PRINTS; PR01572; NPFFRECEPTR2 . 

DR PROSITE; PS00237; G_PROTEIN_RECEP_Fl_l ; 1. 

DR PROSITE; PS50262; G_PROTEIN_RECEP_Fl_2 ; 1. 

SQ SEQUENCE 417 AA; 47449 MW; BB8D85EF4 05D578 6 CRC64; 



Query Match 23.1%; Score 512; DB 11; Length 417; 

Best Local Similarity 32.9%; Pred. No. 1.6e-36; 

Matches 121; Conservative 64; Mismatches 121; Indels 62; Gaps 7 

QV 27 YEDEFLRYLWRDYLYPKQYEWLIAAWAVFWALVGNTLVCLAVWRNHHMRTVTNYFIV 8 6 

II : I : II: I I I : : I : : | | : : | | | | : | | I I I I I I I I | : I | : 
Db 27 YSDINITW-NYYLHQPQVAAVFISSYLLIFVLCMVGNTWCFIVIRNRHMHTVTNFFIL 85 

Qy 87 NLSLADVLVTAICLPASLLVDITESWLFGHALCKVIPYLQAVSVSVAVLTLSFIALDRWY 14 6 

I I : : : I : I I I : I : I I : I I I I : : I I : : I : I | : : | | | | | : | | : 
Db 86 NLAISDLLVGIFCMPITLLDNIIAGWPFGSSMCKISGLVQGISVAASVFTLVAIAVDRFR 145 

Qy 147 AICHPLLFKSTARRARGSILGIWAVSLAIMVPQAAVMECSSVLPELANRTRLFS 200 

' ' I I I = I : I : I I : : : I I I | I : : | | | | | | 



Db 



146 CVVYP FKP KLT VKTAFVT I VI I WGLAI AIMT P S AIMLH VQEEKYYRVRLSSHNKTST 202 



QY 201 --VCDERWADDLYPKIYHSCFFIVTYIAPLGLMAMAYFQIFRKLWGRQIPGTTSALVRNW 258 

I I I : II : I I I I I I I : : I : I I : I : I 

Db 203 VYWCREDWPRHEMRRIYTTVLFATIYLAPLSLIVIMYARIGASLF KTAAHCTGK 256 

Qy 259 KRP S DQLGDLEQGLS GEPQPRARAFLAEVKQMRARRKTAKMLMVVLLVFALC YLP I S VLN 318 

= 11 : : :: I I I I : | | : | | : | | : | 

Db 257 QRP VQWHVSKKKQKVI KMLLTVALLFILSWLPLWTLM 293 

QY 319 VLK RVFGMFRQAS DREAVYAC FT FSHWLVYANS AAN P 1 1 YNFL S GKFREQ 368 

' I I : : : : I : I I I : II : I I I I I I : II 

Db 294 MLSDYTDLSPNKLRIINIY 1 YPFAHWLAFCNS SVNPI I YGFFNENFRNG 342 

Qy 369 FKAAFSCC 376 

1:11 I 
Db 343 FQDAFQIC 350 



RESULT 8 
Q924H0 

ID Q924H0 PRELIMINARY; PRT; 417 AA. 

AC Q924H0; 

DT 01-DEC-2001 (TrEMBLrel. 19, Created) 

DT 01-DEC-2001 (TrEMBLrel. 19, Last sequence update) 

DT 01-JUN-2003 (TrEMBLrel. 24, Last annotation update) 

DE Neuropeptide NPFF receptor. 

GN GPR74. 

OS Mus musculus (Mouse) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostorai ; 

OC Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Mus. 

OX NCBI_TaxID=10090; 

RN [1] 

RP SEQUENCE FROM N.A. 

RA Liu Q. , Guan X.-M., McDonald T.P., Jiang Q. , Zeng Z., Marlene J., 

RA Williams D.L. Jr., Hong Y. , Figueroa D., Clements M.K., Mallee J., 

RA Wang R., Evans J., Gould R., Austin CP.; 

RT "Identification and characterization of two cognate receptors for 

RT mammalian FMRFamide-like neuropeptides."; 

RL Submitted (DEC-2000) to the EMBL/ GenBank/DDBJ databases. 

CC -!- SUBCELLULAR LOCATION: INTEGRAL MEMBRANE PROTEIN (BY SIMILARITY). 

CC -!- SIMILARITY: BELONGS TO FAMILY 1 OF G-PROTEIN COUPLED RECEPTORS. 

DR EMBL; AF330054; AAK94198.1; -. 

DR MGD; MGI: 1860130; Gpr74. 

DR GO; GO: 0016021; C: integral to membrane; IEA. 

DR GO; GO: 0008188; F: neuropeptide receptor activity; IEA. 

DR GO; GO: 0004872; F: receptor activity; IEA. 

DR GO; GO:0001584; F: rhodopsin-like receptor activity; IEA. 

DR GO; GO: 0007186; P:G-protein coupled receptor protein signalin. . .; IEA. 

DR InterPro; IPR000276; GPCR_Rhodpsn . 

DR InterPro; IPR005395; NPFF_receptor . 

DR InterPro; IPR005397; NPFF_receptor2 . 

DR Pfam; PF00001; 7tm_l; 1. 

DR PRINTS; PR00237; GPCRRHODOPSN . 

DR PRINTS; PR01570; NPFFRECEPTOR. 

DR PRINTS; PR01572; NPFFRECEPTR2 . 

DR PROSITE; PS00237; G_PROTEIN RECEP Fl 1; 1. 



DR PROSITE; PS5.0262; G_PROTEIN_RECEP_Fl_2 ; 1. 

KW G-protein coupled receptor; Receptor; Transmembrane. 

SQ SEQUENCE 417 AA; 47485 MW; AC0686514CDAF40C CRC64; 

Query Match 22.9%; Score 509; DB 11; Length 417; 

Best Local Similarity 32.9%; Pred. No. 3e-36; 

Matches 121; Conservative 63; Mismatches 122; Indels 62; Gaps 7; 

Qy 27 YEDEFLRYLWRDYLYPKQYEWLIAAWAVFWALVGNTLVCLAWRNHHMRTVTNYFIV 86 

II : I : I I : I I I : : I : : I I : : I I I I : I I I I I II I I I I : I I : 
Db 2 7 YSDINITYV-NYYLHQPQVAAVFISSYLLIFVLCMVGNTWCFIVIRNRHMHTVTNFFIL 85 

Qy 97 NL S LAD VL VT AI C L PAS L LVD I T E S WL FGHALC KVI P YLQAVS VS VAVLT L S FI ALDRW Y 146 

M : : I : I I I : I : I I : I I II : : I I : : I : I I : : | | | I I : I I : 
Db 8 6 NLAT S DLLVGI FCMP I T LLDNI I AGWP FGS SMCKI S GLVQGI S VAAS VFT LVAI AVDRFR 145 

Qy 147 AICHPLLFKSTARRARGSILGIWAVSLAIMVPQAAVMECSSVLPELANRTRLFS 200 

: : I I I : I : I : I I : : : I I I I I : : | | I I I I 

Db 14 6 CWYPFKPKLTVKTAFVTIVIIWGLAIAIMTPSAIMLH VQEEKYYRVRLS SHNKT ST 2 02 

Qy 2 01 VCDERWAI)DLYPKIYHSCFFIWYLAPLGLMAMAYFQIF 258 

I I I : I I : I I I I I I I : : I : | | : | : | 

Db 203 VYWC RE DW P RHEMRRI YT T VL FAT I YLAP L S L I VI MYAR I GAS L F KTAAHCTGK 256 

Qy 259 KRPSDQLGDLEQGLSGEPQPRARAF]^VKQMRARRKTAKMIJWVLLVFALCYLPISVLN 318 

: I I : : : : I I I I : I I : I I : I I : I 

Db 257 QRPE QWHVSKKKQKVI KMLLTVALLFI LSWLPLWTLM 293 

Qy 319 VLK RVFGMFRQAS D REAVYAC FT FS HWLVYAN S AAN P 1 1 YN FL S GKFREQ 368 

: I I : : : : I : I I I : I I : I I I I I I : II 

Db 294 MLSDYTDLSPNKLRIINIY 1 YPFAHWLAFCNSSVNPIIYGFFNENFRNG 342 

Qy 369 FKAAFSCC 376 

I : I I I 
Db 343 FQDAFQIC 350 



RESULT 9 
Q924N0 

ID Q924N0 PRELIMINARY; PRT; 405 AA. 

AC Q924N0; 

DT 01-DEC-2001 (TrEMBLrel. 19, Created) 

DT 01-DEC-2001 (TrEMBLrel. 19, Last sequence update) 

DT 01-JUN-2003 (TrEMBLrel. 24, Last annotation update) 

DE G-protein-coupled receptor 74. 

GN GPR74 .' : 

OS Mus musculus (Mouse) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Rodentia; Sciurognathi; Muridae; Murinae; Mus. 

OX NCBI_TaxID=10090; 

RN [1] 

RP SEQUENCE FROM N.A. 

RA Parker R.M., Herzog H.; 

RT "Molecular cloning and characterisation of GPR74 a novel G-protein 

RT coupled receptor closest related to the Y-receptor family."; 

RL Brain Res. Mol . Brain Res. 0:0-0(2000). 

CC -!- SUBCELLULAR LOCATION: INTEGRAL MEMBRANE PROTEIN (BY SIMILARITY). 



CC -!- SIMILARITY: BELONGS TO FAMILY 1 OF G-PROTEIN COUPLED RECEPTORS. 

DR EMBL; AF236084; AAK58514.1; 

DR MGD; MGI : 1860130; Gpr74. 

DR GO; GO: 0016021; C:integral to membrane; IEA. 

DR GO; GO: 0008188; F: neuropeptide receptor activity; IEA. 

DR GO; GO: 0004872; F: receptor activity; IEA. 

DR GO; GO: 0001584; F: rhodopsin-like receptor activity; IEA. 

DR GO; GO: 0007186; P:G-protein coupled receptor protein signalin. . .; IEA. 

DR InterPro; IPR000276; GPCR_Rhodpsn . 

DR InterPro; IPR005395; NPFF_receptor . 

DR InterPro; IPR005397; NPFF__receptor2 . 

DR Pfam; PF00001; 7tm_l; 1. 

DR PRINTS; PR00237; GPCRRHODOPSN . 

DR PRINTS; PR01570; NPFFRECEPTOR. 

DR PRINTS; PRO 1572; NPFFRECEPTR2 . 

DR PROSITE; PS00237; G_PROTEIN_RECEP_Fl_l ; 1. 

DR PROSITE; PS50262; G_PROTEIN_RECEP_Fl~2 ; 1. 

KW G-protein coupled receptor; Receptor; Transmembrane. 

SQ SEQUENCE 405 AA; 46248 MW; 8592068E1A17D64B CRC64; 

Query Match 22.9%; Score 508.5; DB 11; Length 405; 

Best Local Similarity 32.6%; Pred. No. 3.2e-36; 

Matches 120; Conservative 65; Mismatches 120; Indels 63; Gaps 

YEDEFLRYLWRDYLYPKQYEWLIAAWAVFWALVGNTLVCIAWRNHHMRTVTNYFIV 8 6 
II : I : I I * I I I : : I : : M : : | | | | : | | I I I I I I I I I : I : 
YS DINITYV-NYYLHQPQVAAVFI SS YLLI FVXCMVGNTVVCFI VIRNRHMHTVTNFLI L 88 

NLS^DVLVTAICLPASLLVDITESWLFGHALCKVIPYLQAVSVSVAVLTLSFIALDRWY 14 
I I : : : I : I I I : | : M : I I I I : : I I : : I : I I : : I | | I I : I I : 



^KSTARRARGSILGIWAVSLAIMVPQAAVMECSSVLPELANRTRLFS 200 

I I : I : I : I I : : : I I I I I : : I ! I I I I 



III : II : I : I I I I I I : : | : | | : | : | 

VYWCREDWPRHEMRRIYTTVLFAIIYLAPLSLIVIMYARIGASLF KTAAHCTGK 259 

KRPSDQLGDLEQGLSGEPQPRARAFLAEVKQMRARRKTAKMLMWLLVFALCYLPISVLN 318 
: I I : : :: I I II : I I : I I : I I : I 

QRPVQCM YQEKQKVI KMLLTVALLFILSWLPLWTLM 295 



Qy 


27 


Db 


30 


Qy 


87 


Db 


89 


Qy 


147 


Db 


149 


Qy 


201 


Db 


206 


Qy 


259 


Db . 


260 


Qy 


319 


Db 


296 


Qy 


369 


Db 


345 



- RVFGMFRQAS DREAVYAC FT FS HWLVYAN S AAN P 1 1 YN FL S GKFREQ 368 

I : :: - I : I I I : I I : I I M I I : II 

liRIINIY IYPFAHWLAFCNSSVNPIIYGFFNENFRNG 344 



I : II I 



RESULT 10 
Q924G9 

ID Q924G9 PRELIMINARY; PRT ; 432 AA. 

AC Q924G9; 



DT 01-DEC-2001 (TrEMBLrel. 19, Created) 

DT 01-DEC-2001 (TrEMBLrel. 19, Last sequence update) 

DT 01-JUN-2003 (TrEMBLrel. 24, Last annotation update) 

DE Neuropeptide NPVF receptor. 

OS Rattus norvegicus (Rat) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleos tomi ; 

OC Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Rattus. 

OX NCBI_TaxI D= 10116; 

RN [1] 

RP SEQUENCE FROM N.A. 

RA Liu Q. , Guan X.-M., McDonald T.P., Jiang Q. , Zeng Z., Marlene J., 

RA Williams D.L. Jr., Hong Y., Figueroa D., Clements M.K., Mallee J. , 

RA Wang R. , Evans J., Gould R. , Austin CP.; 

RT "Identification and characterization of two cognate receptors for 

RT mammalian FMRFamide-like neuropeptides."; 

RL Submitted (DEC-2000) to the EMBL/ GenBank/DDBJ databases. 

CC -!- SUBCELLULAR LOCATION: INTEGRAL MEMBRANE PROTEIN (BY SIMILARITY). 

CC -!- SIMILARITY: BELONGS TO FAMILY 1 OF G-PROTEIN COUPLED RECEPTORS . 

DR EMBL; AF330056; AAK94200.1; -. 

DR GO; GO: 0016021; C: integral to membrane; IEA. 

DR GO; GO: 0008188; F : neuropeptide receptor activity; IEA. 

DR GO; GO:0004872; F:receptor activity; IEA. 

DR GO; GO: 0001584; F : rhodopsin-like receptor activity; IEA. 

DR GO; GO: 0007186; P:G-protein coupled receptor protein signalin. . .; IEA. 

DR InterPro; IPR000276; GPCR_Rhodpsn . 

DR InterPro; IPR005395; NPFF_receptor . 

DR InterPro; IPR005396; NPFF_receptor 1 . 

DR Pfam; PF00001; 7tm_l; 1. 

DR PRINTS; PR00237; GPCRRHODOPSN . 

DR PRINTS; PR01570; NPFFRECEPTOR. 

DR PRINTS; PR01571; NPFFRECEPTR1 . 

DR PROSITE; PS00237; G_PROTEIN_RECEP_Fl_l ; 1. 

DR PROSITE; PS50262; G_PR0TEIN_RECEP_F1_2 ; 1. 

KW G-protein coupled receptor; Receptor; Transmembrane. 

SQ SEQUENCE 432 AA; 48171 MW; EAFDF052C9B7688A CRC64; 

Query Match 21.6%; Score 480; DB 11; Length 432; 

Best Local Similarity 35.5%; Pred. No. l.le-33; 

Matches 121; Conservative 57; Mismatches 121; Indels 42; Gaps 8 

QY 48 VL I AAYVAVFWAL VGN T LVC LAVW RNHHMRT VTN Y F I VN L S LAD VL VT AI C L PAS LL VD 107 

: Mill : I : : : I I I I I I | : | ! I I I I I I | | : | | : : : | : | : : | : : 

Db 45 MFIAAYVLIFLLCIVGNTLVYFIVLKNRHMRTVTNMFILNLAVSDLPVGIFCMPTTLVDN 104 

QY 108 ITESWLFGHALCKVIPYLQAVSVSVAVLTLSFIALDRWYAICHPLLFKSTARRARGSILG 167 

: I I : I II : . , : | I | : | | | | | : : | : | | | | | | : | : | 
Db 105 LITGWPFDNATCKMSGLVQGMSVSASVFTLVAIAVERFRCIVHPFREKLTLRKALFTIAV 164 

Qy 168 IWAVSLAIMVPQAAVM ECSSVLPELANRTRLFSVCDERWADDLYPKI YHSCFFIVT 22 3 

I I I : : I I I I I : I : : I I : I | : I : I : I 

Db 165 IWAIALLIMCPSAVTLTVTREEHHFMLDARNRSYPLYSCWGAWPEKGMRKVYTAVLFAHI 224 

Qy 224 YLAPLGLMAMAYFQIFRKLWGRQIPGTTSALVRNWKRPSDQLGDLEQGLSGEPQPRARAF 283 

I I I I I I : I I I I III | | : | 
Db 225 YLVPLALIVVMYVRIARKLC — QAPGPAR DTEE AV 257 



Qy 



2 84 LAEVKQMRARRKTAKMLMVVLLVFALC YL P I S VLNVLKRVFGMFRQAS DRE AVYAC 339 



: II: I I : I I I I : I I : I I : I : : | : : : | I | 

Db 258 AEGGRTSRRRARVVHMLAMVALFFTLSWLPLWVLLLLID YGELSELQLHLLSVYA- 312 

QY 340 FTFSHWLVYANSAANPIIYNFLSGKFREQFKAAF — SCCLP 378 

I : I I I : : I : M | | | | : : | | I : I I | | 

Db 313 FPLAHWLAFFHSSANPIIYGYFNENFRRGFQAAFRAQLCWP 353 



RESULT 11 
Q7YU49 



ID Q7YU49 PRELIMINARY; PRT; 758 AA. 

AC Q7YU4 9; 

DT 01-OCT-2003 (TrEMBLrel. 25, Created) 

DT 01-OCT-2003 (TrEMBLrel. 25, Last sequence update) 

DT 01-OCT-2003 (TrEMBLrel. 25, Last annotation update) 

DE RE47636p. 

GN CG10823. 

OS Drosophila melanogaster (Fruit fly) . 

OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; 

OC Neoptera; Endopterygota ; Diptera; Brachycera; Muscomorpha; 

OC Ephydroidea; Drosophilidae ; Drosophila. 

OX NCBI_TaxID=7227; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC STRAIN=Berkley; 

RA Stapleton M., Brokstein P., Hong L., Agbayani A., Carlson J. , 

RA Champe M., Chavez C, Dorsett V., Dresnek D., Farfan D., Frise E . , 

RA George R., Gonzalez M., Guarin H., Kronmiller B., Li P., Liao G. , 

RA Miranda A., Mungall C.J., Nunoo J. , Pacleb J., Paragas V., Park S., 

RA Patel S., Phouanenavong S., Wan K. , Yu C, Lewis S.E., Rubin G.M., 

RA Celniker S. ; 

RL Submitted (AUG-2003) to the EMBL/GenBank/DDBJ databases. 

DR EMBL; BT009988; AAQ22457.1; 

SQ SEQUENCE 758 AA; 83441 MW; A2E3BC8D566E009B CRC64; 



Query Match 21.3%; Score 473.5; DB 5; Length 758; 

Best Local Similarity 30.9%; Pred. No. 7.6e-33; 

Matches 133; Conservative 67; Mismatches 141; Indels 89; Gaps 14; 

2 7 YEDEFLRYLWRD YLYPKQYEWVLIAAYVAVFWALVGNTLVCLAVWRNHH 7 6 

MM: I I : I I I : I I : I I : I I : I I I 

Db 179 YNDSALRWEQLDGSVDFGFDPLYRHSLAMSMVYCVAYIWFLVGLIGNSFVIAWLRAPR 238 

QY 77 MRTVTNYFIWLSLADVLVTAICLPASLLVDITESWLFGHALCKVIPYLQAVSVSVAVXT 136 

I I I I I I I I I II I :: I I : I I I I I I : I : : I | : | : I | : I I : I | | | : : | : 
Db 239 MRTWNYFIWLAIADILVIVFCLPATLIGNIFVPWMLGWLMCKFVPYIQGVSVAASVYS 298 

Qy 137 LSFITVLDRWYAICHPLLFKSTT^RARGSILGIWAVSLAIMVPQAAWECSSVT^PELANRT 196 

I : : M I : I I II : I I I I I I : I I I : : | : | : | 

Db 299 LIAVSLDRFIAIWWPLK-QMTKRRARIMIIGIWVIALVTTIPWLLFF DLVPAE 350 

QY 197 RLFS VCDERWADDLYPKIYHSCFFIVT-YLAPLGLMAMAYFQIFRKLW 243 

: I I = I I I : I : I I I | : | : : | | : | : 

Db 351 EVFSDALVSAYSQPQFLCQEWPPGTDGNLYFLLANLVACYLLPMSLITLCYVLIWIKVS 410 

Qy 244 GRQIPGTTSALVRNWKRPSDQLGDLEQGLSGEPQPRARAFLAEVKQMRARRKTAKMLMW 303 



Db 


411 


Qy 


304 


Db 


442 


Qy 


363 


Db 


499 


Qy 


408 


Db 


552 



411 TRSIPGESK DAQMDRMQQ- KS KVKVI KMLVAV 441 

L LVFAL C Y L P I S VLNVL KRV- FGMFRQAS D REAVYAC FT F S HW L VYAN S AAN P 1 1 YN FL S 362 
I Ml: I : |:|| : | : : || : || |||:| : :: 



)FKAAF SCCLPGLGPCGSLK APSPRSSASHKSLSLQSRCS- 407 

II M II I: : :|| | :: | : | | 

iFAAIIKSRSC CGRLRYYDNVAI AS STTSTRKS SHYHQNS SRKS PS SK 551 



Mill 



RESULT 12 
Q9VB87 

ID Q9VB87 PRELIMINARY; PRT; 44 9 AA. 

AC Q9VB87; 

DT 01-MAY-2000 (TrEMBLrel. 13, Created) 

DT 01-OCT-2002 (TrEMBLrel. 22, Last sequence update) 

DT 01-JUN-2003 (TrEMBLrel. 24, Last annotation update) 

DE CG5811 protein. 

GN NEPYR OR CG5811. 

OS Drosophila melanogaster (Fruit fly) . 

OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; 

OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; 

OC Ephydroidea; Drosophilidae; Drosophila. 

OX NCBI_TaxID=7227; 
RN [1] 

RP SEQUENCE FROM N . A. 

RC STRAIN=Berkeley; 

RX MEDLINE=20196006; PubMed=10731132 ; 

RA Adams M.D., Celniker S.E., Holt R.A., Evans C.A., Gocayne J.D., 

RA Amanatides P.G., Scherer S.E., Li P.W., Hoskins R.A. , Galle R.F., 

RA George R.A., Lewis S.E., Richards S., Ashburner M. , Henderson S.N., 

RA Sutton G.G., Wortman J.R., Yandell M.D., Zhang Q., Chen L.X., 

RA Brandon R.C., Rogers Y.-H.C, Blazej R.G., Champe M. , Pfeiffer B.D., 

RA Wan K.H., Doyle C, Baxter E.G., Helt G. , Nelson C.R., Miklos G.L.G., 

RA Abril J.F., Agbayani A., An H.-J., Andrews -Pfannkoch C, Baldwin D. , 

RA Ballew R.M., Basu A., Baxendale J., Bayraktaroglu L. , Beasley E.M., 

RA Beeson K.Y., Benos P.V., Berman B.P., Bhandari D. , Bolshakov S., 

RA Borkova D., Botchan M.R., Bouck J., Brokstein P., Brottier P., 

RA Burtis K.C., Busam D.A., Butler H., Cadieu E., Center A., Chandra I., 

RA Cherry J.M., Cawley S., Dahlke C, Davenport L.B., Davies P., 

RA de Pablos B., Delcher A., Deng Z., Mays A.D., Dew I., Dietz S.M., 

RA Dodson K., Doup L.E., Downes M. , Dugan-Rocha S., Dunkov B.C., Dunn P., 

RA Durbin K.J., Evangelista C.C., Ferraz C, Ferriera S., Fleischmann W., 

RA Fosler C, Gabrielian A.E., Garg N.S., Gelbart W.M., Glasser K. , 

RA Glodek A., Gong F., Gorrell J.H., Gu Z . , Guan P., Harris M. , 

RA Harris N.L., Harvey D., Heiman T.J., Hernandez J.R., Houck J., 

RA Hostin D., Houston K.A., Howland T.J., Wei M.-H., Ibegwam C, 

RA Jalali M. , Kalush F. , Karpen G.H., Ke Z., Kennison J. A., Ketchum K.A., 

RA Kimmel B.E., Kodira CD., Kraft C, Kravitz S., Kulp D., Lai Z., 

RA Lasko P., Lei Y., Levitsky A. A. , Li J., Li Z., Liang Y., Lin X., 

RA Liu X., Mattei B., Mcintosh T.C., McLeod M.P., McPherson D., 

RA Merkulov G. , Milshina N.V., Mobarry C, Morris J., Moshrefi A., 



RA Mount S.M., Moy M. , Murphy B., Murphy L., Muzny D.M., Nelson D.L., 

RA Nelson D.R., Nelson K.A., Nixon K. , Nusskern D.R., Pacleb J.M., 

RA Palazzolo M., Pittman G.S., Pan S., Pollard J., Puri V., Reese M.G., 

RA Reinert K. , Remington K. , Saunders R.D.C., Scheeler F . , Shen H., 

RA Shue B.C., Siden-Kiamos I., Simpson M. , Skupski M.P., Smith T., 

RA Spier E., Spradling A.C., Stapleton M. , Strong R. , Sun E. , 

RA Svirskas R. , Tector C, Turner R. , Venter E., Wang A.H., Wang X., 

RA Wang Z.-Y., Wassarman D.A., Weinstock G.M., Weissenbach J., 

RA Williams S.M., Woodage T., Worley K.C., Wu D., Yang S., Yao Q.A., 

RA Ye J. f Yeh R.-F., Zaveri J.S., Zhan M. , Zhang G . , Zhao Q. , Zheng L . , 

RA Zheng X.H., Zhong F.N. , Zhong W., Zhou X., Zhu S., Zhu X., Smith H.O., 

RA Gibbs R.A. , Myers E.W., Rubin G.M., Venter J.C.; 

RT "The genome sequence of Drosophila melanogas ter . " ; 

RL Science 287:2185-2195(2000). 

RN [2] 

RP SEQUENCE FROM N.A. 

RA Celniker S.E., Adams M.D., Kronmiller B., Wan K.H., Holt R.A. , 

RA Evans C.A., Gocayne J.D., Amanatides P.G., Brandon R.C., Rogers Y., 

RA Banzon J., An H., Baldwin D. , Banzon J., Beeson K. Y. , Busam D.A. , 

RA Carlson J.W., Center A. , Champe M. , Davenport L.B., Dietz S.M., 

RA Dodson K. , Dorsett V. , Doup L.E. f Doyle C, Dresnek D., Farfan D., 

RA Ferriera S., Frise E., Galle R.F., Garg N.S., George R.A., 

RA Gonzalez M. , Houck J., Hoskins R.A. , Hostin D . , Howland T.J., 

RA Ibegwam C, Jalali M. , Kruse D., Li P., Mattei B. , Moshrefi A. , 

RA Mcintosh T.C., Moy M. , Murphy B., Nelson C, Nelson K.A. , Nunoo J., 

RA Pacleb J., Paragas V., Park S., Patel S w Pfeiffer B., 

RA Phouanenavong S w Pittman G.S., Puri V., Richards S., Scheeler F. , 

RA Stapleton M. f Strong R. , Svirskas R. , Tector C, Tyler D., 

RA Williams S.M W Zaveri J.S., Smith H.O., Venter J.C., Rubin G.M.; 

RT "Sequencing of Drosophila melanogas ter genome."; 

RL Submitted (MAR-2000) to the EMBL/ GenBank/DDBJ databases. 

RN [3] 

RP SEQUENCE FROM N.A. 

RA Misra S., Crosby M. A., Matthews B.B., Bayraktaroglu L. f Campbell K. f 

RA Hradecky P., Huang Y., Kaminker J.S., Prochnik S.E., Smith CD., 

RA Tupy J.L., Bergman C, Berman B. , Carlson J.W., Celniker S.E., 

RA Clamp M. , Drysdale R. , Emmert D., Frise E . , de Grey A., Harris N. , 

RA Kronmiller B., Marshall B., Millburn G. f Richter J. r Russo S., 

RA Searle S.M.J., Smith E . , Shu S., Smutniak F. , Whitfield E . , 

RA Ashburner M. , Gelbart W.M., Rubin G.M., Mungall C.J., Lewis S.E.; 

RT "Annotation of Drosophila melanogas ter genome."; 

RL Submitted (MAR-2000) to the EMBL/ GenBank/ DDB J databases. 

RN [4] 

RP SEQUENCE FROM N.A. 

RA Adams M.D., Celniker S.E., Gibbs R.A. , Rubin G.M., Venter C.J.; 

RL Submitted (MAR-2000)- to the EMBL/ GenBank/ DDB J databases. 

RN [5] 

RP SEQUENCE FROM N.A. 

RA FlyBase; 

RL Submitted (SEP-2002) to the EMBL/ GenBank/DDBJ databases. 

DR EMBL; AE003759; AAF56655.2; -. 

DR FlyBase; FBgn0004842; NepYr. 

DR GO; GO: 0016021; C: integral to membrane; IEA. 

DR GO; GO: 0001584; F: rhodopsin-like receptor activity; IEA. 

DR GO; GO: 0007186; P:G-protein coupled receptor protein signalin. . .; IEA. 

DR InterPro; IPR000276; GPCR_Rhodpsn . 

DR Pfam; PF00001; 7tm 1; 1. 



DR PRINTS; PR00237; GPCRRHODOPSN . 

DR PROSITE; PS00237; G_PR0TEIN_RECEP_F1_1 ; 1. 

DR PROSITE; PS50262; G_PROTEIN_RECEP_Fl_2 ; 1. 

SQ SEQUENCE 449 AA; 51869 MW; FD8D0B3D7 04 76EC1 CRC64; 

Query Match 20.4%; Score 452.5; DB 5; Length 449; 

Best Local Similarity 29.4%; Pred. No. 2.9e-31; 

Matches 109; Conservative 64; Mismatches 137; Indels 61; Gaps 9 

Q y 26 DY EDEFLRYLWRDYLYPKQYEWLIAAWAVFWALVGNTLVCLAWRNHHMRTVTNYFI 8 5 

N: || :: :: I : : I : | | : | | | | | : ||||||||| 

Db 75 DYD LLSEDMWSSAYFKII VYTVILYIPIFIFALIGNGTVCYIVYSTPRMRTVTNYFI 129 

Q y 86 ^LS^VLVTAICLPASLL-VDITESWLFGHALCKVIPYLQAVSVSVAVLTLSFIALDR 144 

1on :|:: |:|:: |:|:| : : | | : I Mill |: II |::|| 

Db 130 AS LAI GDI LMS FFCVPS S FI S LFI LNYWP FGLALCHFVNYSQAVSVLVSAYTLVAI S I DR 189 

Qy 145 WYAICHPLLFKSTARRARGSILGIWAVS LAIMVPQAAVMECSSVLPELANRTRLFSVCDE 2 04 

: II II -Ml I I • I : : I I : I | : : : : | | 

Db 190 YIAIMWPLKPRITKRYATFIIAGVWFIALATALPIPIVSGLDIPMSPWHTKCEKY-ICRE 248 

Qy 205 RWADDLYPKIYHSCFFIVTYLAPLGLMAMAYFQIFRKLWGRQIPGTTSALVRNWKRPSDQ 264 

I I I : : : | | | : : | : | : : | : : | | s | | 

Db 249 MWPSRTQEYYYTLSLFALQFWPLGVLIFTYARITIRVWAKRPPGEA ETNRDQ 301 

265 LGDLEQGLS GE PQ PRARAFLAEVKQMRARRKTAKMLMWLLVFALC YLP I S VLNVLKRVF 324 

Db ™? ! |!! " |:|| ::| :| 

Db 302 RMARS KRKMVKMMLT WI VFTCCWLP FNI LQLL 334 

Q y 325 GMFRQAS DREAV YACFTFSHWLVYANSAANPII YNFLSGKFREQFKAAFS 374 

H I I I I III :: Mill ::: :|| | 

Db 335 LNDEEFAHWDPLPYVWFAF-HWLAMSHCCYNPIIYCYMNARFRSGFVQLMHRMPG 388 

Qy 375 CCLPGLG 381 

III : I 

Db 389 LRRWCCLRSVG 399 



RESULT 13 
057463 

ID 057463 PRELIMINARY; PRT; 375 AA 

AC 057463; 

DT 01-JUN-1998 (TrEMBLrel. 06, Created) 

DT 01-JUN-1998 (TrEMBLrel. 06, Last sequence update) 

DT 01-JUN-2003 (TrEMBLrel. 24, Last annotation update) 

DE Neuropeptide Y /peptide YY receptor Yb. 

GN NPYRYB. 

OS Brachydanio rerio (Zebrafish) (Danio rerio) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi ; Cyprinif ormes ; 
OC Cyprinidae; Danio. 
OX NCBI_TaxID=7955; 
RN [1] 

RP SEQUENCE FROM N.A. 

RA Lundell I., Berglund M.M., Starback P., Salaneck S., Gehlert D.R., 
RA Larhammar D.; 

"Cloning and characterization of a novel neuropeptide Y (NPY) receptor 



RT 



DR 
DR 
DR 
DR 



RT subtype in the zebrafish."; 
RL DNA Cell Biol. 0:0-0(1997). 
DR EMBL; AF030245; AAB94616.1; 
DR ZFIN; ZDB-GENE- 98 052 6-208 ; npyryb . 

GO; GO: 0016021; C:integral to membrane; IEA. 
GO; GO: 0004872; F: receptor activity; IEA. 

GO; GO: 0001584; F: rhodopsin-like receptor activity; IEA. 

GO; GO: 0007186; P:G-protein coupled receptor protein signalin. . .; IEA 
DR InterPro; IPR000276; GPCR_Rhodpsn . 
DR Pfam; PF00001; 7tm_l; 1. 
DR PRINTS; PR00237; GPCRRHODOPSN . 
DR PROSITE; PS00237; G_PROTEIN_RECEP_Fl_l ; 1. 
DR PROSITE; PS50262; G_PROTEIN_RECEP_Fl_2 ; 1. 
KW Receptor. " 

SQ SEQUENCE 375 AA; 42116 MW; 4B0C9551131A14D1 CRC64; 

Query Match 20.1%; Score 446; DB 13; Length 375; 

Best Local Similarity 29.4%; Pred. No. 8.8e-31; 

Matches 114; Conservative 70; Mismatches 136; Indels 68; Gaps 10 

Q y 49 L I AA WAVFVVAL VGNT L VC LAVW RN HHMRT VTN Y F I VN L S LAD VL VT AI C L PAS L LVD I 108 

1111 : I I I I I I : : : I I I I I I I I I I I I : | : | | : I | | : : : : 
Db 28 LIVAYS TMLAVGLVGNTCLVWITRQKEMRNVTNIFIVNLSCSDILVCLVCLPVTIIYTL 87 

Qy 109 TESWLFGHALCKVIPYLQAVSVSVAVLTLSFIALDRWYAICHPLLFKSTARRARGSILGI 168 

: I: I Mill |::| :||:|:: :: |||:| | || : | | . , 

Db 88 MDRW ILGEALCKVTPFVQCMSWVSIFSMVLIALERHQLIIHPTGWKPWRHSYLAVAVI 147 

Qy 169 WAVSLAIMVPQAAV-MECSSVLPELANRTRLFS— VCDERWADDLYPKIYHSCFFIVTY 224 

I : : I : I : : : I I : II : I | : | : | : : | 

148 WIIACFISLPFLSFNILTNSPFHNLSLPFNPFSDHFICIEQWPSEGNRLTYTTTLLLCQY 207 

LAPLGLMAMAYFQIFRKLWGRQIPGTTSALVRNWKRPSDQLGDLEQGLSGEPQPRARAFL 2 84 
I I I : : I I : M : I I : | : : | | 
Db 208 CLPLALI LVCYFRI FLRLSRRK DMVERARGGRQ 24 0 

Q y 285 AEVKQMRARRKTAKMLMWLLVFALC YL P I S VLNVLKRVFGMFRQAS DREAVYAC 339 

I : : : : II : : 1111:11 III : I " : | | : | 
Db 2 41 KKAKGS KRVNAMLAS I VAAFALCWL P LNVFNT I FDW NHEAI PVCQHDAI 289 

Qy 340 FTFSHWLVYANSAANPIIYNFLSGKFREQFKAAFSCCLPGLGPCGSLKAPSPRSSAS 396 

' s 1 l :: IIMIIh I::: I: II || | :: | |: | 

Db 290 FSACHLTAMASTCVNPVIYGFLNNNFQKELKSLLSRC-RCWGPAESYES-FPLSTVSTGI 347 



Db 

Qy 225 



397 HKSLSLQSRCSV 408 

II II: : I: 

Db 348 TKGSILSNGSASTYQPHKKNSLEQKESI 375 



RESULT 14 
Q9VWR3 

ID Q9VWR3 PRELIMINARY; PRT; 517 AA. 

AC Q9VWR3; 

DT 01-MAY-2000 (TrEMBLrel. 13, Created) 
DT 01-OCT-2002 (TrEMBLrel. 22, Last sequence update) 
DT 01-JUN-2003 (TrEMBLrel. 24, Last annotation update) 
DE CG6857 protein. 



GN CCKLR-17D1 OR CG6857 . 

OS Drosophila melanogas ter (Fruit fly) . 

OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; 
OC Neoptera; Endopterygota ; Diptera; Brachycera; Muscomorpha; 
OC Ephydroidea; Drosophilidae ; Drosophila. 
OX NCBI_TaxID=7227; 
RN [1] 

RP SEQUENCE FROM N.A. 
RC STRAIN=Berkeley; 

RX MEDLINE=2 0196006; PubMed=10731132 ; 

RA Adams M.D., Celniker S.E., Holt R.A., Evans CA. , Gocayne J.D., 

RA Amanatides P.G., Scherer S.E., Li P.W., Hoskins R.A. , Galle R.F., 

RA George R.A., Lewis S.E., Richards S., Ashburner M. , Henderson S.N 

RA Sutton G.G., Wortman J.R., Yandell M.D., Zhang Q., Chen L.X., 

RA Brandon R.C, Rogers Y.-H.C, Blazej R.G., Champe M. , Pfeiffer B.D 

RA Wan K.H., Doyle C, Baxter E.G., Helt G., Nelson C.R., Miklos G.L.G., 

RA Abril J.F., Agbayani A., An H.-J., Andrews-Pf annkoch C, Baldwin D. , 

RA Ballew R.M., Basu A., Baxendale J., Bayraktaroglu L., Beasley E.M.,' 

RA Beeson K.Y., Benos P.V., Berman B.P., Bhandari D., Bolshakov S., 

RA Borkova D., Botchan M.R., Bouck J., Brokstein P., Brottier P., 

RA Burtis K.C., Busam D.A., Butler H. , Cadieu E. , Center A., Chandra I 

RA Cherry J.M., Cawley S., Dahlke C, Davenport L.B., Davies P., 

RA de Pablos B., Delcher A., Deng Z., Mays A.D., Dew I., Dietz S.M., 

RA Dodson K. , Doup L.E., Downes M. , Dugan-Rocha S., Dunkov B.C., Dunn P 

RA Durbin K.J., Evangelista C.C., Ferraz C , Ferriera S., Fleischmann W.' 

RA Fosler C, Gabrielian A.E., Garg N.S., Gelbart W.M., Glasser K., 

RA Glodek A., Gong F. , Gorrell J.H., Gu Z., Guan P. , Harris M. , 

RA Harris N.L., Harvey D., Heiman T.J., Hernandez J.R., Houck'j., 

RA Hostin D., Houston K.A. , Howland T.J., Wei M.-H., Ibegwam C,' 

RA Jalali M. , Kalush F. , Karpen G.H., Ke Z., Kennison J. A., Ketchum K.A 

RA Kiirimel B.E., Kodira CD., Kraft C, Kravitz S., Kulp D., Lai Z 

RA Lasko P., Lei Y., Levitsky A. A. , Li J . , Li Z . , Liang Y., Lin X.,' 

RA Liu X., Mattel B., Mcintosh T.C, McLeod M.P., McPherson D., 

RA Merkulov G., Milshina N.V., Mobarry C, Morris J., Moshrefi A., 

RA Mount S.M., Moy M. , Murphy B., Murphy L., Muzny D.M., Nelson D.L., 

RA Nelson D.R., Nelson K.A., Nixon K. , Nusskern D.R., Pacleb J M 

RA Palazzolo M. , Pittman G.S., Pan S., Pollard J., Puri v., ReeseVc, 

RA Reinert K., Remington K., Saunders R.D.C, Scheeler F. , Shen H., 

RA Shue B.C., Siden-Kiamos I., Simpson M. , Skupski M.P., Smith T.,' 

RA Spier E . , Spradling A.C, Stapleton M. , Strong R. , Sun E., 

RA Svirskas R. , Tector C, Turner R. , Venter E., Wang A.H., Wang X., 

RA Wang Z.-Y., Wassarman D.A. , Weinstock G.M., Weissenbach J., 

RA Williams S.M., Woodage T . , Worley K.C, Wu D., Yang S., Yao Q.A. , 

RA Ye J. , Yeh R.-F., Zaveri J.S., Zhan M., Zhang G., Zhao Q. , Zheng'li., 

RA Zheng X.H., Zhong F.N., Zhong W., Zhou X., Zhu S., Zhu X., Smith H.O., 

RA Gibbs R.A., Myers E.W., Rubin G.M., Venter J.C; 

RT "The genome sequence of Drosophila melanogaster . " ; 

RL Science 2 87:2185-2195 (2000) . 

RN [2] 

RP SEQUENCE FROM N.A. 

RA Celniker S.E., Adams M.D., Kronmiller B., Wan K.H., Holt R.A. , 

RA Evans C.A., Gocayne J.D., Amanatides P.G., Brandon R.C, Rogers Y. , 

RA Banzon J., An H., Baldwin D., Banzon J., Beeson K.Y. f Busam D.A. , 

RA Carlson J.W., Center A., Champe M. , Davenport L.B., Dietz S.M., 

RA Dodson K. , Dorsett V., Doup L.E., Doyle C, Dresnek D., Farfan^D 

RA Ferriera S., Frise E., Galle R.F., Garg N.S., George R.A., 

RA Gonzalez M., Houck J., Hoskins R.A., Hostin D., Howland T.J., 



DR 
DR 
DR 
DR 
DR 



RA Ibegwam C, Jalali M. , Kruse D., Li P., Mattel B., Moshrefi A. , 

RA Mcintosh T.C., Moy M. , Murphy B. , Nelson C, Nelson K.A. , Nunoo J , 

RA Pacleb J., Paragas v., Park S., Patel S., Pfeiffer B., 

RA Phouanenavong S., Pittman G.S., Purl V. , Richards S., Scheeler F. , 

RA Stapleton M. , Strong R. , Svirskas R. , Tector C, Tyler D. , 

RA Williams S.M., Zaveri J.S., Smith H.O., Venter J.C., Rubin G.M. ; 

RT "Sequencing of Drosophila melanogaster genome."; 

RL Submitted (MAR-2000) to the EMBL/ GenBank/DDBJ databases. 

RN [3] 

RP SEQUENCE FROM N.A. 

RA Misra S., Crosby M. A. , Matthews B.B., Bayraktaroglu L. , Campbell K. 

RA Hradecky P., Huang Y., Kaminker J.S., Prochnik S.E., Smith CD., 

RA Tupy J.L., Bergman C, Berman B. , Carlson J.W., Celniker S.E., 

RA Clamp M. , Drysdale R., Emmert D. , Frise E., de Grey A., Harris N 

RA Kronmiller B., Marshall B., Millburn G . , Richter J., Russo S 

RA Searle S.M.J. , Smith E., Shu S., Smutniak F. , Whitfield E. 

RA Ashburner M. , Gelbart W.M., Rubin G.M. , Mungall C.J., Lewis S.E.; 

RT "Annotation of Drosophila melanogaster genome."; 

RL Submitted (MAR-2000) to the EMBL/ GenBank/DDBJ databases. 

RN [ 4 ] 

RP SEQUENCE FROM N.A. 

RA Adams M.D., Celniker S.E., Gibbs R.A., Rubin G.M., Venter C.J ; 

RL Submitted (MAR-2000) to the EMBL/ GenBank/DDBJ databases 

RN [5] 

RP SEQUENCE FROM N.A. 

RA FlyBase; 

RL Submitted (SEP-2002) to the EMBL/ GenBank/DDBJ databases. 

CC -!- SUBCELLULAR LOCATION: INTEGRAL MEMBRANE PROTEIN (BY SIMILARITY) 

CC -!- SIMILARITY: BELONGS TO FAMILY 1 OF G-PROTEIN COUPLED RECEPTORS 

DR EMBL; AE003509; AAF48875.2; -. ^* 

DR FlyBase; FBgn0030950; CCKLR-17D1. 

GO; GO: 0016021; C: integral to membrane; IEA. 
GO; GO: 0004872; F:receptor activity; IEA. 

GO; GO: 0001584; F: rhodopsin-like receptor activity; TEA. 

GO; GO: 0007186; P:G-protein coupled receptor protein signalin. . . ; IEA 
InterPro; IPR000276; GPCR_Rhodpsn . 

DR Pfam; PF00001; 7tm_l; 1. 

DR PRINTS; PR00237; GPCRRHODOPSN . 

DR PROSITE; PS00237; G_PR0TEIN_RECEP_F1_1 ; 1. 

DR PROSITE; PS50262; G_PR0TEIN_RECEP_F1_2 ; 1. 

KW G-protein coupled receptor; Receptor? Transmembrane. 

SQ SEQUENCE 517 AA; 57670 MW; 7E7BA42776E69D11 CRC64 ; 

Query Match 20.1%; Score 446; DB 5; Length 517; 

Best Local Similarity 30.3%; Pred. No. 1.3e-30; 

Matches 132; Conservative 76; Mismatches 148; Indels 80; Gaps 18; 

Qy 1 MEPSATPGAQMGVP PGSREPSPVPPDYEDEFLRYLWRDYLYPKQYEWVLIAAYV 54 

I: : II : I ||:| | | : | .. , 

Db 142 MDLATT P S S S T LAP AVTVRT P GNRS WRVS ADVP IW WPCYS 183 

Qy 55 AVFWAL VGN T L VC LAVWRN H HMRT VTN Y F I VN L S LAD VL VTA I C L PAS L LVD I T E S WL F 114 

1 : : I : I I I I I I ' : I I I I : I I I : : I I : : : | : | : | : | : | : : : : | 
Db 184 AI LLCAWGNLLWLTLVQNRRMRTITNVFLLNLAI S DI LLGVFCMPVTLVGTLLRHFI F 243 

QY 115 GHALCKVI P YLQAVS VS VAVLTLS FI ALDRW YAICHPLLFKS — TARRARGS I LGI WAVS 172 

1 I I I • I : I I | | : | : | | I : : I : | | | | | | | :: | | | , | , 



Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 



244 GELLCKLIQFAQAASVAVSSWTLVAISCERYYAICHPLRSRTWQTINHANKIIAIIWLGS 303 

173 LAIWPQAAVMECSSVLPELANRTRLFSVCDERW-ADDL-YPKIYHSCFF 230 

1 I I I I I ::| :| I I |:| II I | : |: | : | | | 
304 LVCMTPIAAF SQLMP TSRPGL-RKCREQWPADSLNYERAYN — LFLDLALLVLPL 355 

231 MAMAYFQIFRKLWGRQIPGTTSALVRNWKRPSDQLGDLEQGLSGEPQPRARAFL^ 290 

M::: :l I I :|| : : | || 
356 LALSFTYLF 1 TRTL YVSMRNERA MNFGSSG L 386 

291 RARRKTAKMLMWLLVFALCYLPISVLNVLKRVFGMFRQASDREAV-YACFTFSHWLVYA 349 

: : : : I I I I : : I I : I : I : I : I : : | s | | | . | , , . 

387 ESKKRWKMLFVLVLEFFICWTPLYVINTMTMLLG PTVYEYVGYTSISFLQLLAYS 442 

350 NSAANPIIYNFLSGKFREQFKAAFSCCLPGLGPCGSLKAPS — PRSSASHKSLSLQSR- 405 

AAO : II I I I: I I || | | : : || : 

443 S S CCNP I T YC FMNAS FRRAFVDT FK GMRVCERLCAPCCFWRRRSKNETNLSVAGNS 498 

406 — CSVSKISEHWLTS 419 

: I : I I : I I 
499 IALANSVMSSHTILES 514 



RESULT 15 
Q9VRM0 
ID 
AC 
DT 
DT 
DT 
DE 
GN 
OS 



oc 
oc 
oc 
ox 

RN 
RP 
RC 
RX 
RA 
RA 
RA 
RA 
RA 
RA 
RA 
RA 
RA 
RA 
RA 
RA 
RA 
RA 
RA 
RA 



PRELIMINARY; 



13, 
13, 
24, 



PRT; 54 0 AA. 
Created) 

Last sequence update) 
Last annotation update) 



Q9VRM0 
Q9VRM0; 

01-MAY-2000 (TrEMBLrel 
01-MAY-2000 (TrEMBLrel 
01-JUN-2003 (TrEMBLrel 
CG10626 protein. 
CG10626. 

Drosophila melanogas ter (Fruit fly) . 
Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; 
Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; 
Ephydroidea; Drosophilidae; Drosophila. 
NCBIJTaxID=7227; 
[1] 

SEQUENCE FROM N.A. 
STRAIN=Berkeley; 

MEDLINE=20196006; PubMed=l 073 1132 ; 
Adams M.D., Celniker S.E., Holt R.A 
Amanatides P.G., Scherer S.E., Li P 
George R.A. , Lewis S.E., Richards S 
Sutton G.G. 
Brandon R.C 



.W. 



Evans C.A., Gocayne J.D., 
, Hoskins R.A., Galle R.F., 
Ashburner M. , Henderson S.N., 
Wortman J.R., Yandell M.D., Zhang Q. , Chen L.X., 
Rogers Y . -H . C Blaze j R.G., Champe M. , Pfeiffer B.D. 



Wan K.H., Doyle C, Baxter E.G., Helt G., Nelson C.R., Miklos G.L.G., 
Abril J.F., Agbayani A., An H.-J., Andrews-Pf annkoch C, Baldwin D., 
Ballew R.M., Basu A., Baxendale J., Bayraktaroglu L., Beasley E.M. , 



Beeson K.Y., Benos P.V., Berman B. 
Borkova D., Botchan M.R., Bouck J. 
Burtis K.C., Busam D.A., Butler H. 
Cherry J.M., Cawley S., Dahlke C, 
de Pablos B., Delcher A., Deng Z, 



P., Bhandari D., Bolshakov S., 
Brokstein P., Brottier P., 
Cadieu E . , Center A., Chandra I., 
Davenport L.B., Davies P., 
, Mays A.D., Dew I., Dietz S.M., 
Dodson K., Doup L.E., Downes M. , Dugan-Rocha S., Dunkov B.C., Dunn P., 
Durbrn K.J., Evangelista C.C., Ferraz C, Ferriera S., Fleischmann W., 
Fosler C, Gabrielian A.E., Garg N.S., Gelbart W.M., Glasser K , 



RA Glodek A., Gong F., Gorrell J.H., Gu Z., Guan P., Harris M. , 

RA Harris N.L., Harvey D., Heiman T.J., Hernandez J. R. , Houck J., 

RA Hostin D., Houston K-A. , Howland T.J., Wei M.-H., Ibegwam C, 

RA Jalali M. , Kalush F. , Karpen G.H., Ke Z., Kennison J. A., Ketchum K.A., 

RA Kiramel B.E., Kodira CD., Kraft C. , Kravitz S., Kulp D., Lai Z., 

RA Lasko P. , Lei Y., Levitsky A. A. , Li J. , Li Z., Liang Y. , Lin X., 

RA Liu X., Mattei B., Mcintosh T.C., McLeod M.P., McPherson D., 

RA Merkulov G., Milshina N.V., Mobarry C, Morris J . , Moshrefi A. , 

RA Mount S.M., Moy M. , Murphy B., Murphy L . , Muzny D.M., Nelson D.L., 

RA Nelson D.R., Nelson K.A., Nixon K., Nusskern D.R., Pacleb J.M., 

RA Palazzolo M. , Pittman G.S., Pan S., Pollard J., Puri V., Reese M.G., 

RA Reinert K., Remington K., Saunders R.D.C., Scheeler F. , Shen H., 

RA Shue B.C., Siden-Kiamos I., Simpson M. , Skupski M.P., Smith T., 

RA Spier E., Spradling A.C., Stapleton M. , Strong R. , Sun E., 

RA Svirskas R. , Tector C, Turner R. , Venter E. , Wang A.H., Wang X., 



RA Wang Z.-Y., Wassarman D.A. , Weinstock G.M., Weissenbach J., 

RA Williams S.M., Woodage T., Worley K.C., Wu D w Yang S., Yao Q.A., 

RA Ye J., Yeh R.-F., Zaveri J.S., Zhan. M,, Zhang G., Zhao Q. , Zheng L., 

RA Zheng X.H., Zhong F.N., Zhong W. , Zhou X. , Zhu S., Zhu X. , Smith H.O., 

RA Gibbs R.A. , Myers E.W., Rubin G.M., Venter J.C.; 

RT "The genome sequence of Drosophila melanogas ter . " ; 

RL Science 287:2185-2195 (2000) . 

CC -!- SUBCELLULAR LOCATION: INTEGRAL MEMBRANE PROTEIN (BY SIMILARITY). 

CC -!- SIMILARITY: BELONGS TO FAMILY 1 OF G-PROTEIN COUPLED RECEPTORS. 

DR EMBL; AE003566; AAF50775.1; -. 

DR HSSP; P02699; 1F88. 

DR FlyBase; FBgn0035610; CG10626. 

DR GO; GO: 0016323; C : basolateral plasma membrane; IDA. 

DR GO; GO: 0042071; F:leucokinin receptor activity; IDA. 

DR InterPro; IPR000276; GPCR_Rhodpsn . 

DR Pfam; PF00001; 7tm_l; 1. 

DR PRINTS; PR00237; GPCRRHODOPSN . 

DR PROSITE; PS00237; G_PR0TEIN_RECEP_F1_1 ; 1. 

DR PROSITE; PS502 62; G_PR0TEIN_RECEP_F1_2 ; 1. 

KW G-protein coupled receptor; Receptor; Transmembrane. 

SQ SEQUENCE 540 AA; 60612 MW; 2D22D5E1BDFD47D5 CRC64; 



Query Match 20.0%; Score 444; DB 5; Length 540; 

Best Local Similarity 27.8%; Pred. No. 2e-30; 

Matches 114; Conservative 73; Mismatches 151; Indels 72; Gaps 9 

Qy 15 PGSREPSPVPPDYEDEFLRYLWRDYLYPKQYEWVTilAAWAVFWALVGNTLVCLAWRN 74 

II: I I M I II: :| | : :||::||||| | 

Db 14 PGAEE EAEFERL YAAPAE I VAL LSIFYGGISI VAVI GN T LVI WWAT T 61 

Qy 75 HHMRTVTNYFIWLSLADVLVTAICLPASLLVDITESWLFGHALCKVIPYLQAVSVSVAV 134 

IMIII :| II: Ml:: |:| : :|| :| |::||:||:|:| 

Db 62 RQMRTVTNMYIANLAFADVIIGLFCIPFQFQAALLQSWNLPWFMCSFCPFVQALSVNVSV 121 

Qy 135 LTLSFIALDRWYAICHPLLFKSTARRARGSILGIWAVS LAIMVPQAAVMECSSVLP 190 

I I : I I : I I I I : I I : I : : I I I I : : I | | | : 
Db 122 FTLTAIAIDRHRAIINPLRARPTKFVSKFIIGGIWMLALLFAVPFAIAFRVEELTERFRE 181 

Qy 191 --ELANRTRLFSVCDERWADDLYPKIYHSCFFIVTYLAPLGLMAMAYFQIFRKLWGRQIP 248 

I Mil::: : | | : : Mill : : : I | : : | | | : | 

Db 182 NNETYNVTRPFCM-NKNLSDDQLQSFRYTLVF-VQYLVPFCVISFVYIQMAVRLWGTRAP 239 



Qy 249 GTTSALVRNWKRPSDQLGDLEQGLSGEPQPRARAFIAEVKQMRARRKTAKMLMWLLVF^ 308 

o,n ' . :: :: ::| |||::|:::| 
Db 240 GNAQD S RDI TLLKNKKKVI KMLIIWI I FG 2 69 

309 LC YLP I S VLNVLKRVFGMFRQAS DREAVYACFT FS HWLVYANSAANP 1 1 YN FL S GKFREQ 368 
I I : I I : : I : I : : : I : : II : I I I I I I : | | : : 
Db 270 LCWLPLQLYNIL YVTIPEINDYHFISIVWFCCDWLAMSNSCYNPFI YGI YNEKFKRE 326 

Qy 369 FKAAFSCCLPGLGPCG SLKAPSPRSSASHKSLSLQS 4 04 

I I : 1 I : I I I I : : : I : : : I 

Db 327 FNKRFAACF CKFKTSMDAHERTFSMHTRASSIRSTYANSSMRIRS 371 



Search completed: September 28, 2004, 10:00:11 
Job time : 124 sees 



GenCore version 5.1.6 
Copyright (c) 1993 - 2004 Compugen Ltd. 



OM protein - protein search, using sw model 
Run on : 



Title: 

Perfect score: 
Sequence : 



September 28, 2004, 09:21:40 ; Search time 13 Seconds 

(without alignments) 
1702.294 Million cell updates/sec 

US-10-070-532-2 
2219 

1 ME P SAT P GAQMGVP P G S RE P CSVSKISEHWLTSVTTVLP 425 



Scoring table: BLOSUM62 

Gapop 10.0 , Gapext 0.5 

Searched: 141681 seqs, 52070155 residues 

Total number of hits satisfying chosen parameters: 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 



141681 



Database 



SwissProt 42: 



Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 



SUMMARIES 



Result 



Query 



No. 


Score 


Match 


Length 


DB 


ID 


Description 


1 


2214 


99. 


8 


425 


1 


OX1R HUMAN 


043613 


homo sapien 


2 


2018.5 


91. 


0 


416 


1 


OX1R RAT 


P56718 


rattus norv 


3 


1478.5 


66. 


6 


460 


1 


OX2R_MOUSE 


P58308 


mus musculu 


4 


1474 . 5 


66. 


4 


460 


1 


OX2R_RAT 


P56719 


rattus norv 


5 


1469 


66. 


2 


444 


1 


0X2R CAN FA 


Q9tup7 


canis famil 


6 


1460 


65. 


8 


444 


1 


0X2 R HUMAN 


043614 


homo sapien 


7 


1261.5 


56. 


8 


260 


1 


OX1R MOUSE 


P58307 


mus musculu 


8 


522.5 


23. 


5 


430 


1 


NFF1_HUMAN 


Q9gzq6 


homo sapien 


9 


507.5 


22. 


9 


432 


1 


NFF1 RAT 


Q9ep86 


rattus norv 


10 


499.5 


22. 


5 


522 


1 


NFF2_HUMAN 


Q9y5x5 


homo sapien 


11 


499 


22. 


5 


417 


1 


NFF2 RAT' 


Q9eqd2 


rattus norv 


12 


466.5 


21. 


0 


427 


1 


CCKR RABIT 


097772 


oryctolagus 


13 


451 


20. 


3 


382 


1 


NY2R_PIG 


002836 


sus scrofa 


14 


447.5 


20. 


2 


449 


1 


NYR_DROME 


P25931 


drosophila 


15 


437 


19. 


7 


86 


1 


OX1RJPIG 


097661 


sus scrofa 


16 


435 


19. 


6 


384 


1 


NY2R BOVIN 


P79113 


bos taurus 


17 


434.5 


19. 


6 


385 


1 


NY2R MOUSE 


P97295 


mus musculu 



18 


434 


1 Q 

x _? 


m O 


? 7 n 


1 

X 


brl U 


iiUI v I/\IN 


P49683 


homo sapien 


19 
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19 
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X 


MY 9 13 


HT IMAM 


P4 y 14 6 


homo sapien 


20 
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19 
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1 
X 


OX X U 


RAT 


Qo4 12 1 


rattus norv 


21 
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1 Q 

X Zs 


A 
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1 


MV9 D 
IN I ZK 


C AT /DO 


Q9z2d5 


cavia porce 


22 
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1 
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gallus gall 


23 
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1 Q 
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homo sapien 


24 
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1 Q 

X _/ 
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4 J J 


1 
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VtTTvTT A 
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P70031 


xenopus lae 


25 


426 


1 9 

X .37 . 
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^ ft 1 
O O X 


1 
X 


MV9 D 
IN i Z K 


I v Lrt.LXYlU 


Q9gk74 


macaca mula 


26 


421 . 5 


1 Q 

X ,7 i 


n 


4 4 4 


1 

1 


LL1\K 


DAT 1 


P30551 


rattus norv 


27 
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1 ft 
X o < 


Q 

. Z7 


4 £7 


1 
X 




KA.nl i 


097512 


oryctolagus 


28 


419 


1 ft 

X O ! 


Q 
• _? 


4 (17 


1 
X 


IN l\XK 


UAV irU 


P30547 


cavia porce 


29 


418.5 


18 . 


Q 

. y 


4 S9 


X 


IN J\vJ K 




P47937 


mus musculu 


30 


4 1 ft 

*i X U 


1 ft 
X o • 


G 

, o 


4 0 7 


-1 
1 


XTT/1 T~) 

N K1K 


KAI 


P14600 


rattus norv 


31 


417.5 


1 ft 

X o • 


Q 

i o 


J X z? 


1 


rn t r>9 


UKUIYlb 


P30975 


drosophila 


32 


417 


1 ft 

X o . 


Q 

0 


4 n*7 

*i U / 


1 


INI\1K 


TJT T-\Jf TV llT 

riUMAN 


P25103 


homo sapien 


33 


4 17 

" X / 


1 ft 


0 


4U / 


-1 
1 


*T V 1 T) 

JM K1K 




P30548 


mus musculu 


34 


417 


1 ft 

X o . 


Q 
O 


4 ^ n 

*i J u 


X 




LAVfU 


Q63931 


cavia porce 


35 
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1 ft 

X O . 


Q 
O 


/ICO 


1 


i^* 7\ C O 

bAbK 


MOUSE 


P56481 


mus musculu 


36 
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18 . 


P 

u 


4 9 9 


X 




UAIN r A 


Q9ttq9 


canis famil 


37 


416 


1 ft 

X o . 


7 


4 

*i DO 


1 
1 


IN J\jK 


TJT I'M" A XT 

nUJYLAlN 


P29371 


homo sapien 


38 
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1 ft 

X o ■ 


7 
f 
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P16177 


rattus norv 
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1 ft 
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NK2R 
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Q64077 


cavia porce 


40 
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6 


436 


1 


CCKR 


MOUSE 


O0ft7ft^ 


mus musculu 


41 


409.5 


18. 


5 


452 


1 


gasr" 


RABIT 


P46627 


oryctolagus 


42 


408.5 


18. 


4 


423 


1 


GP83 


HUMAN 


Q9nyin4 


homo sapien 


43 


406.5 


18. 


3 


450 


1 


GASR 


"PRANA 


P30796 


praomys nat 


44 


404 


18. 


2 


346 


1 


GALR 


"rat 


Q62805 


rattus norv 


45 


403.5 


18.2 


398 


1 


NK2R 


HUMAN 


P21452 


homo sapien 



ALIGNMENTS 



RESULT 1 
0X1R_HUMAN 

ID 0X1R_HUMAN STANDARD; PRT; 425 AA. 

AC 043613; 

DT 30-MAY-2000 (Rel. 39, Created) 

DT 30-MAY-2000 (Rel. 39, Last sequence update) 

DT 28-FEB-2003 (Rel. 41, Last annotation update) 

DE Orexin receptor type 1 (Oxlr) (Hypocretin receptor type 1) . 

GN HCRTR1. 

OS Homo sapiens (Human) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleos tomi ; 

OC Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 

OX NCBIJTaxID=9606; 

RN [1] 

RP SEQUENCE FROM N.A. 

RX MEDLINE=98150861; PubMed=94 91897 ; 

RA Sakurai T., Amemiya A., Ishii M. , Matsuzaki I., Chemelli R.M., 

RA Tanaka H., Williams S.C., Richardson J. A., Kozlowski G.P., Wilson S., 

RA Arch J.R.S., Buckingham R.E., Haynes A.C., Carr S.A., Annan R.S., 

RA McNulty D.E., Liu W.-S., Terrett J. A. , Elshourbagy N.A., Bergsma D.J., 

RA Yanagisawa M. ; 

RT "Orexins and orexin receptors: a family of hypothalamic neuropeptides 

RT and G protein-coupled receptors that regulate feeding behavior "; 

RL Cell 92:573-585(1998). 



RN [2] 

RP REVIEW. 

RX MEDLINE=21237974; PubMed=11340621 ; 

RA Hungs M. , Mignot E . ; 

RT "Hypocretin/orexin, sleep and narcolepsy."; 

RL Bioessays 23:397-408(2001). 

RN [3] 

RP REVIEW. 

RX MEDLINE=21178476; PubMed=l 12 83317 ; 

RA Willie J.T., Chemelli R.M., Sinton CM., Yanagisawa M. ; 

RT "To eat or to sleep? Orexin in the regulation of feeding and 

RT wakefulness."; 

RL Annu. Rev. Neurosci. 24:429-458(2001). 

CC -!- FUNCTION: Moderately selective excitatory receptor for orexin-A 

and ' with a lower affinity, for orexin-B neuropeptide. Seems to be 
CC exclusively coupled to the G(q) subclass of heteromeric G 

CC proteins, which activates the phospholipase C mediated signaling 

CC cascade (By similarity) . 

CC -!- SUBCELLULAR LOCATION: Integral membrane protein. 

CC -!- SIMILARITY: Belongs to family 1 of G-protein coupled receptors. 



CC 



CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 
CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib . ch) . 

CC 

DR EMBL; AF041243; AAC39601.1; -. 

DR Genew; HGNC:4 84 8; HCRTR1 . 

DR MIM; 602392; -. 

DR GO; GO: 0005887; C: integral to plasma membrane; TAS . 

DR GO; GO: 0004930; F: G-protein coupled receptor activity; TAS 

DR GO; GO:0007631; P : feeding behavior ; TAS. 

DR GO; GO: 0007218; P : neuropeptide signaling pathway; TAS. 

DR GO; GO:0007268; P: synaptic transmission; TAS. 

DR InterPro; IPR000276; GPCR_Rhodpsn . 

DR InterPro; IPR004059; Orexin_receptorl . 

DR Pfam; PF00001; 7tm_l; 1. 

DR PRINTS; PR00237; GPCRRHODOPSN . 

DR PRINTS; PR01521; OREXIN1R. 

DR PROSITE; PS00237; G_PR0TEIN_RECEP_F1_1 ; 1. 

DR PROSITE; PS50262; G_PR0TEIN_RECEP_F1_2 ; 1. 

KW G-protein coupled receptor; Transmembrane. 

FT DOMAIN 1 46 EXTRACELLULAR (POTENTIAL). 

FT TRANSMEM 47 67 1 (POTENTIAL). 

FT DOMAIN 68 8 0 CYTOPLASMIC (POTENTIAL) . 

FT TRANSMEM 81 102 2 (POTENTIAL). 

FT DOMAIN 103 119 EXTRACELLULAR (POTENTIAL) 

FT TRANSMEM 120 142 3 (POTENTIAL). 

FT DOMAIN 143 164 CYTOPLASMIC (POTENTIAL). 

FT TRANSMEM 165 185 4 (POTENTIAL). 

FT DOMAIN 186 216 EXTRACELLULAR (POTENTIAL). 

FT TRANSMEM 217 239 5 (POTENTIAL). 

FT DOMAIN 240 298 CYTOPLASMIC (POTENTIAL). 

FT TRANSMEM 299 321 6 (POTENTIAL). 



FT 


DOMAIN 


322 


336 


EXTRACELLULAR (POTENTIAL) . 


FT 


TRANSMEM 


337 


360 


7 (POTENTIAL) . 


FT 


DOMAIN 


361 


425 


CYTOPLASMIC (POTENTIAL) . 


FT 


CARBOHYD 


194 


194 


N-LINKED (GLCNAC. . . ) (POTENTIAL) 


SQ 


SEQUENCE 


425 AA; 


47521 


MW; 1634083DE10CA092 CRC64; 



Query Match 99.8%; Score 2214; DB 1; Length 425; 

Best Local Similarity 99.5%; Pred. No. 5.4e-144; 

Matches 423; Conservative 1; Mismatches 1; Indels 0; Gaps 0 

Qy 1 MEPSATPGAQMGVPPGSREPSPVPPDYEDEFLRYLWRDYLYPKQYEWVLIAAYVAVFWA 60 

I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I M I I | 
Db 1 MEPSATPGAQMGVPPGSREPSPVPPDYEDEFLRYLWRDYLYPKQYEWVLIAAYVAVFWA 60 

Qy 61 LVGNTLVCLAWRNHHMRT WNYFI WLS LADVL VTAI CL PAS LLVD I T ESWL FGHALCK 120 

M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I II I 
Db 61 LVGNTLVCLAWRNHHMRTWNYFIWLSLADVLVTAICLPASLLVDITESWLFGHALCK 12 0 

Qy 121 VIPYLQAVSVSVAVLTLSFIALDRWYAICHPLLFKSTARRARGSILGIWAVSLAIMVPQA 180 

I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 121 VIPYLQAVSVSVAVLTLSFIALDRWYAICHPLLFKSTARRARGSILGIWAVSLAIMVPQA 180 

Qy 181 AVl^ECSSVLPELANRTRLFSVCDERWADDLYPKIYHSCFFIVTYLAPLGLMAMAYFQIFR 240 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 181 AVT4ECSSVLPELANRTRLFSVCDERWADDLYPKIYHSCFFIWYLAPLGLMAMAYFQIFR 240 

Qy 241 KLWGRQIPGTTSALVRNWKRPSDQLGDLEQGLSGEPQPRARAFLAEVKQMRARRKTAKML 300 

I M I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I | | | | | | | | | | | | | | | | | 
Db 241 KLWGRQI PGTT S ALVRNWKRP S DQLGDLEQGLS GE PQ PRGRAFLAEVKQMRARRKTAKML 300 

Qy 301 MWLLVFALC YL P I S VLNVLKRVFGMFRQAS DREAVYAC FT FSHWLVYANS AAN P 1 1 YN F 360 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 301 MWLLVFAL C YL P I S VLNVL K RVFGM FRQAS DREAVYAC FT F S HW LVYAN S AAN P 1 1 YN F 360 

Qy 361 LSGKFREQFKAAFSCCLPGLGPCGSLKAPSPRSSASHKSLSLQSRCSVSKISEHWLTSV 420 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I : I I I I I I I I I I I I 
Db 361 LSGKFREQFKAAFSCCLPGLGPCGSLKAPSPRSSASHKSLSLQSRCSISKISEHWLTSV 420 

Qy 421 TTVLP 425 

I I I I I 

Db 421 TTVLP 425 



RESULT 2 
OX1R RAT 



ID OXlR_RAT STANDARD; PRT; 416 AA. 

AC P5 6718; 

DT 30-MAY-2000 (Rel. 39, Created) 

DT 30-MAY-2000 (Rel. 39, Last sequence update) 

DT 28-FEB-2003 (Rel. 41, Last annotation update) 

DE Orexin receptor type 1 (Oxlr) (Hypocretin receptor type 1) . 

GN HCRTR1. 

OS Rattus norvegicus (Rat) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Rattus. 

OX NCBI_TaxID=10116; 

RN [1] 



RP SEQUENCE FROM N.A. 

RC TISSUE=Brain; 

RX MEDLINE=98150861; PubMed=94 918 97 ; 

RA Sakurai T., Amemiya A., Ishii M., Matsuzaki . I . , Chemelli R.M. , 

RA Tanaka H., Williams S.C., Richardson J. A. , Kozlowski G.P., Wilson S., 

RA Arch J.R.S., Buckingham R.E., Haynes A.C., Carr S.A., Annan R.S., 

RA McNulty D.E., Liu W.-S., Terrett J .A. , Elshourbagy N.A., Bergsma D.J., 

RA Yanagisawa M. ; 

RT "Orexins and orexin receptors: a family of hypothalamic neuropeptides 

RT and G protein-coupled receptors that regulate feeding behavior."; 

RL Cell 92:573-585(1998). 

RN [2] 

RP REVIEW. 

RX MEDLINE=21237974; PubMed=1134 0621 ; 

RA Hungs M. , Mignot E. ; 

RT "Hypocretin/orexin, sleep and narcolepsy."; 

RL Bioessays 23:397-408(2001). 

RN [3] 

RP REVIEW. 

RX MEDLINE=21178476; PubMed=11283317 ; 

RA Willie J.T., Chemelli R.M. , Sinton CM., Yanagisawa M. ; 

RT "To eat or to sleep? Orexin in the regulation of feeding and 

RT wakefulness . "; 

RL Annu. Rev. Neurosci. 24:429-458(2 001). 

CC -!- FUNCTION: Moderately selective excitatory receptor for orexin-A 
CC and, with a lower affinity, for orexin-B neuropeptide. Seems to be 

CC exclusively coupled to the G(q) subclass of heteromeric G 

CC proteins, which activates the phospholipase C mediated signaling 

CC cascade. 

CC -!- SUBCELLULAR LOCATION: Integral membrane protein. 

CC -!- TISSUE SPECIFICITY: Highly expressed in the brain in the 

CC prefrontal cortex, hippocampus, paraventricular thalamus, 

CC ventromedial hypothalamus, arcuate nucleus, dorsal raphe nucleus, 

CC and locus coeruleus . Not detected in the spleen, lung, liver, 

CC skeletal muscle, kidney and testis. Orexin receptor mRNA 

CC expression has also been reported in the adrenal gland, enteric 

CC nervous system, and pancreas. 

CC -!- INDUCTION: By nutritional state, up-regulated by fasting. 

CC -!- SIMILARITY: Belongs to family 1 of G-protein coupled receptors. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib . ch) . 

CC 

DR EMBL; AF041244; AAC40041.1; -. 

DR InterPro; IPR000276; GPCR_Rhodpsn . 

DR InterPro; IPR004059; Orexin_receptorl . 

DR Pfam; PF00001; 7tm_l; 1. 

DR PRINTS; PR00237; GPCRRHODOPSN . 

DR PRINTS; PR01521; OREXIN1R. 

DR PROSITE; PS002 37; G_PR0TEINJRECEP_F1_1 ; 1. 

DR PROSITE; PS50262; G_PROTEINJRECEP_Fl_2 ; 1. 

KW G-protein coupled receptor; Transmembrane. 



FT 


DOMAIN 


1 


46 


EXTRACELLULAR (POTENTIAL) . 


FT 


TRANS MEM 


47 


67 


1 (POTENTIAL) . 


FT 


DOMAIN 


68 


80 


CYTOPLASMIC (POTENTIAL) . 


FT 


TRANSMEM 


81 


102 


2 (POTENTIAL) . 


FT 


DOMAIN 


103 


119 


EXTRACELLULAR (POTENTIAL) . 


FT 


TRANSMEM 


120 


142 


3 (POTENTIAL) . 


FT 


DOMAIN 


143 


164 


CYTOPLASMIC (POTENTIAL) . 


FT 


TRANSMEM 


165 


185 


4 (POTENTIAL) . 


FT 


DOMAIN 


186 


216 


EXTRACELLULAR (POTENTIAL) . 


FT 


TRANSMEM 


217 


239 


5 ( POTENTIAL) . 


FT 


DOMAIN 


240 


298 


CYTOPLASMIC (POTENTIAL) . 


FT 


TRANSMEM 


299 


321 


6 (POTENTIAL) . 


FT 


DOMAIN 


322 


336 


EXTRACELLULAR (POTENTIAL). 


FT 


TRANSMEM 


337 


360 


7 (POTENTIAL) . 


FT 


DOMAIN 


361 


416 


CYTOPLASMIC (POTENTIAL) . 


FT 


CARBOHYD 


194 


194 


N-LINKED (GLCNAC. . .) (POTENTIAL 


SQ 


SEQUENCE 


416 AA; 


46799 


MW; 774DE7A22EA05D18 CRC64; 



Query Match 91.0%; Score 2018.5; DB 1; Length 416; 

Best Local Similarity 92.0%; Pred. No. l.le-130; 

Matches 390; Conservative 8; Mismatches 17; Indels 9; Gaps 1 

Qy 1 ME P SAT P GAQMGVP PGS REP S PVP P DYEDEFLRYLWRDYL YPKQ YEWVLI AAYVAVFVVA 60 

I I I I I I I I I I III I I I : I I I I I I I I 1 I I I I I I I I I I I I I I I I I I I I I I I I I :: I 
Db 1 MEPSATPGAQPGVPTSSGEPFHLPPDYEDEFLRYLWRDYLYPKQYEWVLIAAYVAVFLIA 60 

Qy 61 LVGNTLVCLAWRNHHMRTWNYFIWLSLADVXWAICLPASLLVDITESWLFGHALCK 12 0 

M I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I 

Db 61 LVGNTLVCIAWRNHHMRTWNYFIWLSLADVLVTAICLPASLLVDITESWLFGHALCK 12 0 

Qy 121 VIPYLQAVSVSVAVLTLSFIALDRWYAICHPLLFKSTARRARGSILGIWAVSLAIMVPQA 180 

I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I | | | | || | | | | | | | M I : I II I I 
Db 121 VIPYLQAVSVSVAVLTLSFIALDRWYAICHPLLFKSTARRARGSILGIWAVSLAVMVPQA 180 

Qy 181 AVMECSSVLPELANRTRLFSVCDERWADDLYPKIYHSCFFIVTYLAPLGLMAMAYFQIFR 240 

I I I I I I I I I M I I I I I I I I I I I I I I I I I : I I I I I I II I I I I I I I I I I I I I II I I I I I I 
Db 181 AVWECSSVLPELANRTRLFSVCDERWADELYPKIYHSCFFFVTYLAPLGLMGMAYFQIFR 240 

Qy 241 KLWGRQIPGTTSALVRNWKRPSDQLGDLEQGLSGEPQPRARAFLAEVKQMRARRKTAKML 300 

I I I I M I I I I I I I I I I I I I I I : I I III I I I II I I M I I I I II I I I I I I I I I I I 

Db 241 KLWGPQIPGTTSALVRNWKRPSEQLEAQHQGLCTEPQPRARAFLAEVKQMRARRKTAKML 300 

Qy 301 MWLL VFAL C YL P I S VLNVLKRVFGMFRQAS DREAVYAC FT FS HWLVYAN S AAN P 1 1 YN F 360 

I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I 

Db 301 MWL LVFALC YL P I S VLNVL KRVFGMFRQAS D REAVYAC FT FS HWLVYAN S AAN P 1 1 YN F 360 

Qy 361 LSGKFREQFKAAFSCCLPGLGPCGSLKAPSPRSSASHKSLSLQSRCSVSKISEHWLTSV 420 

M I I I 1 I II I I I I I I I I I I I I I III I I I I I I I I I I I I I I : I I I I I I I : I 

Db 361 LSGKFREQFKAAFSCCLPGLG PSSSARHKSLSLQSRCSVSKVSEHWLTTV 411 

Qy 421 TTVL 424 

I I I I 

Db 412 TTVL 415 



RESULT 3 
OX2R MOUSE 



ID 0X2R_M0USE STANDARD; PRT; 460 AA. 

AC P58308; Q8BG12; 

DT 16-OCT-2001 (Rel. 40, Created) 

DT 10-OCT-2003 (Rel. 42, Last sequence update) 

DT 10-OCT-2003 (Rel. 42, Last annotation update) 

DE Orexin receptor type 2 (Ox2r) (Hypocretin receptor type 2) . 

GN HCRTR2 OR MOX2R. 

OS Mus musculus (Mouse) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleos tomi ; 

OC Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Mus. 

OX NCBI_TaxID=10090; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC STRAIN=C57BL/6J; TISSUE=Cerebellum, and Hypothalamus; 

RX MEDLINE=22354683; PubMed=124 66851 ; 

RA Okazaki Y. , Furuno M. , Kasukawa T., Adachi J., Bono H., Kondo S., 

RA Nikaido I., Osato N., Saito R. , Suzuki H., Yamanaka I., Kiyosawa H., 

RA Yagi K., Tomaru Y., Hasegawa Y., Nogami A., Schonbach C, Gojobori T., 

RA Baldarelli R. , Hill D.P., Bult C, Hume D.A., Quackenbush J. , 

RA Schriml L.M., Kanapin A., Matsuda H., Batalov S., Beisel K.W., 

RA Blake J. A. , Bradt D., Brusic V. , Chothia C, Corbani L.E., Cousins S., 

RA Dalla E., Dragani T.A., Fletcher C.F., Forrest A., Frazer K.S., 

RA Gaasterland T., Gariboldi M. , Gissi C, Godzik A., Gough J., 

RA Grimmond S., Gustincich S., Hirokawa N., Jackson I.J., Jarvis E.D., 

RA Kanai A., Kawaji H. f Kawasawa Y., Kedzierski R.M. , King B.L., 

RA Konagaya A., Kurochkin I.V., Lee Y., Lenhard B., Lyons P. A., 

RA Maglott D.R., Maltais L., Marchionni L w McKenzie L., Miki H. f 

RA Nagashima T., Numata K. , Okido T., Pavan W.J., Pertea G., Pesole G. , 

RA Petrovsky N . , Pillai R. , Pontius J.U., Qi D., Ramachandran S., 

RA Ravasi T., Reed J.C., Reed D.J., Reid J. , Ring B.Z., Ringwald M. , 

RA Sandelin A., Schneider C, Semple C.A., Setou M. , Shimada K. , 

RA Sultana R. , Takenaka Y. , Taylor M.S., Teasdale R.D., Tomita M. , 

RA Verardo R., Wagner L., Wahlestedt C, Wang Y., Watanabe Y. r Wells C, 

RA Wilming L.G., Wynshaw-Boris A., Yanagisawa M. , Yang I., Yang L. , 

RA Yuan Z., Zavolan M. , Zhu Y. , Zimmer A., Carninci P., Hayatsu N., 

RA Hirozane-Kishikawa T., Konno H., Nakamura M w Sakazume N . , Sato K., 

RA Shiraki T., Waki K., Kawai J., Aizawa K. , Arakawa T., Fukuda S., 

RA Hara A., Hashizume W., Imotani K., Ishii Y., Itoh M. , Kagawa I., 

RA Miyazaki A., Sakai K., Sasaki D., Shibata K., Shinagawa A., 

RA Yasunishi A., Yoshino M. , Waterston R. , Lander E.S., Rogers J., 

RA Birney E . , Hayashizaki Y. ; 

RT "Analysis of the mouse transcriptome based on functional annotation of 

RT 60,770 full-length cDNAs . " ; 

RL Nature 420:563-573(2002). 

RN [2] 

RP SEQUENCE OF 100-311 FROM N.A. 

RC STRAIN=C57BL/6; 

RA Szendro P.I., Maevers K., Eichele G. ; 

RT "Cloning of mouse orexin receptors."; 

RL Submitted (JUN-2001) to the EMBL/ GenBank/DDBJ databases. 

RN [3] 

RP REVIEW. 

RX MEDLINE=21237974; PubMed=11340621 ; 

RA Hungs M. , Mignot E. ; 

RT "Hypocretin/orexin, sleep and narcolepsy."; 

RL Bioessays 23:397-408(2001). 

RN [4] 



RP REVIEW. 

RX MEDLINE=21178476; PubMed=11283317 ; 

RA Willie J.T., Chemelli R.M. , Sinton CM., Yanagisawa M. ; 

RT "To eat or to sleep? Orexin in the regulation of feeding and 

RT wakefulness. 11 ; 

RL Annu. Rev. Neurosci. 24:42 9-458(2001). 

CC -!- FUNCTION: Nonselective, high-affinity receptor for both orexin-A 
CC and orexin-B neuropeptides . 

CC -!- SUBCELLULAR LOCATION: Integral membrane protein. 

CC -!- SIMILARITY: Belongs to family 1 of G-protein coupled receptors. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib . ch) . 

CC 

DR EMBL; AK038551; BAC30039.1; -. 

DR EMBL; AK048781; BAC33457.1; -. 

DR EMBL; AF394597; AAK71327.1; -. 

DR MGD; MGI : 1889024; Mox2r. 

DR InterPro; IPR000276; GPCR_Rhodpsn . 

DR InterPro; IPR004060; Orexin_receptor2 . 

DR Pfam; PF00001; 7tm_l; 1. 

DR PRINTS; PR00237; GPCRRHODOPSN . 



DR 


PRINTS; PR01522; OREXIN2R. 




DR 


PROSITE; 


PS00237; 


G_PROTEIN 


RECEP Fl 1; 1. 


DR 


PROSITE; 


PS50262; 


G_PROTEIN_ 


RECEP_F1_2; 1. 


KW 


G-protein 


coupled 
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Transmembrane; Glycoprotein. 
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FT 


CONFLICT 
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201 


A -> T (IN REF. 2) . 


FT 


CONFLICT 


240 


240 


I -> V (IN REF. 2) . 


SQ 


SEQUENCE 


460 AA; 


52461 MW 


; D62A67C15BA67DCC CRC64; 



Query Match 66.6%; Score 1478.5; DB 1; Length 460; 

Best' Local Similarity 69.7%; Pred. No. 7.4e-94; 

Matches 287; Conservative 48; Mismatches 68; Indels 9; Gaps 4; 



Qy 
Db 



17 SREPSPVPPDYED-EFLRYLWRDYLYPKQYEWLIAAYVAVFWALVGNTLVCLAVWRNH 75 

••II I H = l IMIIII|:|t:||:|||llll |: I I I I t I = 

24 TQEPFLNPTDYDDEEFLRYLWREYLHPKEYEWVLIAGYIIVFWALIGNVLVCVAWKNH 83 



QY 7 6 HMRTVTNYFIVNLSLADVLVTAICLPASLLVDITESWLFGHALCKVIPYLQAVSVSVAVL 135 

IN MINIM M M : I : :| M : M M M M I MMMM 

Db 84 HMRTVTNYFIVNLSLADVLVTITCLPATLWDITETWFFGQSLCKVIPYLQTVSVSVSVL 143 

QY 136 TLSFIALDRWYAICHPLLFKSTARRARGSILGIWAVSIAIMVPQAAWECSSVLPEIANR 195 

Ml M M M M : M I M : M I M : M M M : M I M M M : M MM 

Db 144 TLSCIALDRWYAICHPLMFKSTAKRARNSIWIWIVSCIIMIPQAIVMECSSMLPGLANK 203 

QY 196 TRLFSVCDERWADDLYPKIYHSCFFIVTYLAPLGLMAMAYFQIFRKLWGRQIPGTTSALV 2 55 

I M : M M I : : I I I : II II I : I I I : I I I I I : I I Mlllll I M II I : I : 
Db 204 TTLFTVCDEHWGGEWPKMYHICFFLVTYMAPLCLMILAYLQIFRKLWCRQIPGTSSWQ 263 

Qy 256 RNWK--RPSDQLGDLEQGLSGEPQPRARAFIAEVKQMRARRKTAKMLMWLLVFALCYLP 313 

IN : I I M : : I I I I : I I : I I I I I I I : I | MMMM 

Db 264 RKWKQQQPVSQ PRGSGQQS KARI SAVAAEI KQI RARRKTARMLMVVLLVFAI C YLP 319 

QY 314 ISVLNVLKRVFGMFRQASDREAVYACFTFSHWLVYANSAANPIIYNFLSGKFREQFKAAF 373 

M : I I I II I I I I II III II I II I I I II I I I II I I M I I I Ill: 

Db 320 ISILNVLKRVFGMFTHTEDRETVYAWFTFSHWLVYANSAANPIIYNFLSGKFREEFKAAF 379 

Qy 374 SCCLPGLGPCGSLKAPSPRSSASHKSLSLQ— SRCSVSKISEHWLTSVTTV 423 

MM I I I : I II I : I : : I I I : I I I I I I I I : : I : 

Db 380 SCCLGVHHRQGDRLARGRTSTESRKSLTTQISNFDNVSKLSEHWLTSISTL 431 



RESULT 4 
OX2R_RAT 

ID OX2R_RAT STANDARD; PRT; 4 60 AA. 

AC P56719; 

DT 30-MAY-2000 (Rel. 39, Created) 

DT 30-MAY-2000 (Rel. 39, Last sequence update) 

DT 28-FEB-2003 (Rel. 41, Last annotation update) 

DE Orexin receptor type 2 (Ox2r) (Hypocretin receptor type 2) . 

GN HCRTR2 . 

OS Rattus norvegicus (Rat) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Rattus. 

OX NCBI_TaxID=10116; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC TISSUE=Brain; 

RX MEDLINE=98150861; PubMed=9491897 ; 

RA Sakurai T. f Amemiya A., Ishii M., Matsuzaki I., Chemelli R.M. , 

RA Tanaka H., Williams S.C., Richardson J. A., Kozlowski G.P., Wilson S., 

RA Arch J.R.S., Buckingham R.E., Haynes A.C., Carr S.A., Annan R.S., 

RA McNulty D.E., Liu W.-S., Terrett J. A., Elshourbagy N.A., Bergsma D.J., 

RA Yanagisawa M. ; 

RT "Orexins and orexin receptors: a family of hypothalamic neuropeptides 

RT and G protein-coupled receptors that regulate feeding behavior."; 

RL Cell 92:573-585(1998). 

RN [2] 

RP REVIEW. 

RX MEDLINE=21237974; PubMed=11340621 ; 



RA Hungs M. , Mignot E. ; 

RT "Hypocretin/orexin, sleep and narcolepsy."; 

RL Bioessays 23:397-408(2001). 

RN [3] 

RP REVIEW. 

RX MEDLINE=21178476; PubMed=112 83317 ; 

RA Willie J.T., Chemelli R.M., Sinton CM., Yanagisawa M. ; 

RT "To eat or to sleep? Orexin in the regulation of feeding and 

RT wakefulness."; 

RL Annu. Rev. Neurosci. 24:42 9-458(2001). 

CC -!- FUNCTION: Nonselective, high-affinity receptor for both orexin-A 
CC and orexin-B neuropeptides. 

CC -!- SUBCELLULAR LOCATION: Integral membrane protein. 

CC -!- TISSUE SPECIFICITY: Expressed in the brain in the cerebral 

CC cortex, septal nuclei, hippocampus, medial thalamic groups, dorsal 

cc and median raphe nuclei, and many hypothalamic nuclei including 

cc the tuberomammillary nucleus, dorsomedial hypothalamus, 

CC paraventricular hypothalamic nucleus, and ventral premammillary 

CC nucleus. Not detected in the spleen, lung, liver, skeletal muscle, 

CC kidney and testis. Orexin receptor mRNA expression has also been 

CC reported in the adrenal gland, enteric nervous system, and 

CC pancreas . 

CC -!- SIMILARITY: Belongs to family 1 of G-protein coupled receptors. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib . ch) . 

CC 

DR EMBL; AF041246; AAC40042.1; -. 

DR InterPro; IPR000276; GPCR_Rhodpsn . 

DR InterPro; IPR004060; 0rexin_receptor2 . 

DR Pfam; PF00001; 7tm_l; 1. 

DR Pfam; PF03827; Orexin rec2; 1. 



DR 


PRINTS; 


PR00237; 


GPCRRHODOPSN. 


DR 


PRINTS; 


PR01522; 


OREXIN2R. 




DR 


PROSITE; 


PS00237; 


G_PR0TEIN_ 


RECEP Fl 1; 1. 


DR 


PROSITE; 


PS50262; 


G_PROTEIN~ 


~RECEP_F1_2; 1. 


KW 


G-protein coupled 


receptor; 


Transmembrane; Glycoprotein 
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FT CARBOHYD 14 14 N-LINKED (GLCNAC. . .) (POTENTIAL). 

FT CARBOHYD 22 22 N-LINKED (GLCNAC. . .) (POTENTIAL). 

FT CARBOHYD 2 02 2 02 N-LINKED (GLCNAC. . .) (POTENTIAL). 

SQ SEQUENCE 460 AA; 52489 MW; 3B4 4E3D82F8B85D5 CRC64; 

Query Match 66.4%; Score 1474.5; DB 1; Length 460; 

Best Local Similarity 69.8%; Pred. No. 1.4e-93; 

Matches 286; Conservative 48; Mismatches 71; Indels 5; Gaps 3 

Qy 17 SREPSPVPPDYED-EFLRYLWRDYLYPKQYEWVLIAAYVAVFWALVGNTLVCLAVWRNH 75 

: : I I I I I : I I I I I I I I I : I I : I I : I I I I I I I I : I I I I I I : I I I I I : I I | : I I 
Db 2 4 TQEPFLNPTDYDDEEFLRYLWREYLHPKEYEWVLIAGYIIVFWALIGNVLVCVAVWKNH 8 3 

QY 76 HMRT VTN Y F I VN L S LAD VL VT AI C L PAS L LVD I T E S WL FGHAL C KVT P Y LQAVS VS VAVL 135 

I I I I I I I I I II I I II I I I I I I M M : | : | | | | | : | | | : I f I i I I | | | I I I I : I I 
Db 84 HMRTVTNYFIWLSLADVLVTITCLPATLWDITETWFFGQSLCKVIPYLQTVSVSVSVL 143 

Qy 136 TLSFIALDRWYAICHPLLFKSTARRARGSILGIWAVSLAIMVPQAAVMECSSVLPELANR 195 

Ml I I I I I I I I I I I I I : I I I I I : I I I II: II II I I : I I I I II I I : I I III: 
Db 144 TLSCIALDRWYAICHPLMFKSTAKRARNSIWIWIVSCIIMIPQAIVMERSSMLPGLANK 203 

Qy 196 TRLFSVCDERWADDLYPKIYHSCFFIVTYLAPLGLMAMAYFQIFRKLWGRQIPGTTSALV 255 

I 11:111111 : : I I I : I I I I : I I I : I I I || : I I I I I I I I I I II I I I : I : 
Db 204 TTLFTVCDERWGGEWPKMYHICFFLWYMAPLCLMVXAYLQIFRKLWCRQIPGTSSWQ 263 

Qy 256 RNWKRPSDQLGDLEQGLSGEPQPRARAFLAEVKQMRARRKTAKMLMWLLVFALCYLPIS 315 

I 11 = 1 I : I : : I I II : I I : I I I I I I I : I I I I I I I I I I : I I I I I I 

Db 264 RKWKQ P — Q P VS Q P RGS GQQ S KARI S AVAAE I KQ I RARRKTARMLMWLLVFAI C YL PIS 321 

Qy 316 VLNVLKRVFGMFRQASDREAVYACFTFSHWLVYANSAANPIIYNFLSGKFREQFKAAFSC 375 

• I I I I I I I I I I I I M III I I I II I I | | | | M | | | | | | | | | | | | | | | : | | | | | | | 

Db 322 ILNVLKRVFGMFTHTEDRETVYAWFTFSHWLVYANSAANPIIYNFLSGKFREEFKAAFSC 381 

Qy 376 CLPGLGPCGSLKAPSPRSSASHKSLSLQ — SRCSVSKISEHWLTSVTTV 423 

I I I I I : I III: I : :l I 1:1 I I I I I I ::| : 

Db 382 CLGVHRRQGDRLARGRTSTESRKSLTTQISNFDNVSKLSEHVALTSISTL 431 



RESULT 5 
OX2R_CANFA 

ID OX2R_CANFA STANDARD; PRT; 444 AA. 

AC Q9TUP7; 

DT 16-OCT-2001 (Rel. 40, Created) 

DT 16-OCT-2001 (Rel. 40, Last sequence update) 

DT 28-FEB-2003 (Rel. 41, Last annotation update) 

DE Orexin receptor type 2 (Ox2r) (Hypocretin receptor type 2) . 

GN HCRTR2. 

OS Canis familiaris (Dog) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Carnivora; Fissipedia; Canidae; Canis. 

OX NCBI_TaxID=9615; 
RN [1] 

RP SEQUENCE FROM N.A. 

RX MEDLINE=99385793; PubMed=10458 611 ; 

RA Lin L., Faraco J., Li R. , Kadotani H., Rogers W., Lin X., Qiu X., 

RA de Jong P. J., Nishino S., Mignot E. ; 

RT "The sleep disorder canine narcolepsy is caused by a mutation in the 



RT hypocretin receptor 2 gene."; 

RL Cell 98:365-376(1999). 

RN [2] 

RP REVIEW. 

RX MEDLINE=21237974; PubMed=l 134 0 62 1 ; 

RA Hungs M. , Mignot E . ; 

RT "Hypocretin/orexin, sleep and narcolepsy."; 

RL Bioessays 23:397-4 08(2001). 

RN [3] 

RP REVIEW. 

RX MEDLINE=21178476; PubMed=11283317 ; 

RA Willie J.T., Chemelli R.M., Sinton CM., Yanagisawa M. ; 

RT "To eat or to sleep? Orexin in the regulation of feeding and 

RT wakefulness."; 

RL Annu. Rev. Neurosci. 24:429-458(2001). 

RN [4] 

RP VARIANT NARCOLEPSY LYS-54, AND MUTAGENESIS OF GLU-54. 

RX MEDLINE=21180003; PubMed=112 82 968 ; 

RA Hungs M. , Fan J., Lin L., Lin X., Maki R.A. , Mignot E.; 

RT "Identification and functional analysis of mutations in the hypocretin 

RT (orexin) genes of narcoleptic canines."; 

RL Genome Res. 11:531-539(2001). 

CC -!- FUNCTION: Nonselective, high-affinity receptor for both orexin-A 
CC and orexin-B neuropeptides. 

CC -!- SUBCELLULAR LOCATION: Integral membrane protein. 

CC -!- DISEASE: Defects in HCRTR2 are a cause of an autosomal recessive 

CC form of narcolepsy, observed in labradors, dobermans and 

CC dachshunds. Narcolepsy is a neurological sleep disorder affecting 

CC animals and humans, characterized by excessive daytime sleepiness, 

CC sleep fragmentation, symptoms of abnormal rapid-eye-mouvement 

CC (REM) sleep, such as cataplexy, hypnagogic hallucinations, and 

CC sleep paralysis. Cataplexy is a sudden loss of muscle tone 

CC triggered by emotions, which is the most valuable clinical feature 

CC used to diagnose narcolepsy. As in humans, most cases of canine 

CC narcolepsy are sporadic but an autosomal recessive form was also 

CC observed. 

CC -!- SIMILARITY: Belongs to family 1 of G-protein coupled receptors. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib . ch) . 

CC * 

DR EMBL; AF164626; AAD49333.1; -. 

DR InterPro; IPR000276; GPCR_Rhodpsn . 

DR InterPro; IPR004060; Orexin_receptor2 . 

DR Pfam; PF00001; 7tm_l; 1. 

DR Pfam; PF03827; 0rexin_rec2; 1. 

DR PRINTS; PR00237; GPCRRHODOPSN . 

DR PRINTS; PR01522; OREXIN2R. 

DR PROSITE; PS00237; G_PR0TEIN_RECEP_F1_1 ; 1. 

DR PROSITE; PS50262; G_PR0TEIN_RECEP_F1_2 ; 1. 

KW G-protein coupled receptor; Transmembrane; Glycoprotein; 

KW Disease mutation. 
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54 


54 
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SQ 


SEQUENCE 


444 AA; 
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MW; D848A4536D485D6B CRC64; 



Query Match 66.2%; Score 1469; DB 1; Length 444; 

Best Local Similarity 69.2%; Pred. No. 3.2e-93; 

Matches 286; Conservative 51; Mismatches 66; Indels 10; Gaps 5 

Qy 17 SREPSPVPPDYED-EFLRYLWRDYLYPKQYEWLIAAWAVFWALVGNTLVCLAVWRNH 75 

-II I I I : I I I I I I I I I : I I : I I : I I I I I I I I : I I I I I I I I I I I I : I I I : I I 
Db 24 TQEPFLNPTDYDDEEFLRYLWREYLHPKEYEWVLIAGYIIVFWALVGNVLVCVAVWKNH 83 

Qy 76 HMRT VT N Y F I VN L S LAD VL VT AI C L PAS L LVD I T E S W L F GHAL C KVI P Y LQ AVS VS VAVL 135 

I I I I I I I I I I I I I I II I I I I I I I I I : I : I I I I I : I I I : I I I I I I I I I Mill:!! 
Db 84 HMRTWNYFIWLSLADVLVTITCLPATLWDITETWFFGQSLCKVIPYLQTVSVSVSVL 14 3 

Qy 136 TLSFIALDRWYAICHPLLFKSTARRARGSILGIWAVSLAIMVPQAAvTVIECSSVLPELANR 195 

Ml I I I I I I I I I I I I I : I I I I I : I I I II: II II I I : I I I I I I I I :: I I III: 
Db 144 TLSCIALDRWYAICHPLMFKSTAKRARNSIVI IWIVSCI IMI PQAI VMECSTMLPGLANK 203 

Qy 196 TRLFSVCDERWADDLYPKIYHSCFFIVTYLAPLGLMAMAYFQIFRKLWGRQIPGTTSALV 255 

I I I : I I II II : : I I I : I I I I I : I I I : I I I II : I I I I I I I I I I I I I I I : I : 
Db 204 TTLFTVCDERWGGEIYPKMYHICFFLVTYMAPLCLMVLAYLQIFRKLWCRQIPGTSSWQ 263 

Qy 256 RNWK — RPSDQLGDLEQGLSGEPQPRARAFLAEVKQMRARRKTAKMLMWLLVFALCYLP 313 

Ml : I : I =1 s : I I I I : I I : I I I I I I I : I I I I I : I I I I 

Db 264 RKWKPLQPASQ— — FRGPGQQTKSRISAVAAEIKQIRARRKTARMLMWLLVFAICYLP 319 

Qy 314 I S VLNVLKRVFGMFRQAS DREAVYAC FT FS HWLVYAN SAAN P 1 1 YN FL S GK FREQ FKAAF 373 

M : I I I I I I I I I I I III III I I I I I I I I I II I II | I I | | | | | | | I I | | : | | | | | 

Db 320 ISILNVLKRVFGMFTHTEDRETVYAWFTFSfiWLVYANSAANPIIYNFLSGKFREEFKAAF 379 

Qy 374 SCCLPGLGPCGSLKAPSPRSSA-SHKSLSLQ— SRCSVSKISEHWLTS VTTV 423 

Ml I : : I : I I I I I : I : : I I I : I I I I I I I : : I : 

Db 380 SCCCLGVHHRQEDRLTRGRTSTESRKSLTTQISNFDNVSKLSEQWLTSISTL 432 



RESULT 6 



0X2R_HUMAN 

ID 0X2R_HUMAN STANDARD; PRT; 444 AA. 

AC 043614; 

DT 30-MAY-2000 (Rel. 39, Created) 

DT 30-MAY-2000 (Rel. 39, Last sequence update) 

DT 28-FEB-2003 (Rel. 41, Last annotation update) 

DE Orexin receptor type 2 (Ox2r) (Hypocretin receptor type 2). 

GN HCRTR2. 

OS Homo sapiens (Human) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 

OX NCBI_TaxID=9606; 
RN [1] 

RP SEQUENCE FROM N.A. 

RX MEDLINE=98150861; PubMed-94 918 97 ; 

RA Sakurai T., Amemiya A., Ishii M. , Matsuzaki I., Chemelli R.M., 

RA Tanaka H., Williams S.C., Richardson J. A., Kozlowski G.P., Wilson S., 

RA Arch J.R.S., Buckingham R.E., Haynes A.C., Carr S.A., Annan R.S., 

RA McNulty D.E., Liu W.-S., Terrett J. A., Elshourbagy N.A., Bergsma D.J., 

RA Yanagisawa M. ; 

RT "Orexins and orexin receptors: a family of hypothalamic neuropeptides 

RT and G protein-coupled receptors that regulate feeding behavior."; 

RL Cell 92:573-585(1998). 

RN [2] 

RP SEQUENCE FROM N.A. 

RX MEDLINE=21580342; PubMed=117232 85 ; 

RA Olafsdottir B.R., Rye D.B., Scammell T.E., Matheson J.K., 

RA Stefansson K., Gulcher J.R.; 

RT "Polymorphisms in hypocretin/orexin pathway genes and narcolepsy."; 

RL Neurology 57:1896-1899(2001) . 

RN [3] 

RP REVIEW. 

RX MEDLINE=21237974; PubMed=l 134 0621 ; 

RA Hungs M. , Mignot E. ; 

RT "Hypocretin/orexin, sleep and narcolepsy."; 

RL Bioessays 23:397-408(2001). 

RN [4] 

RP REVIEW. 

RX MEDLINE=21178476; PubMed=l 12 83317 ; 

RA Willie J.T., Chemelli R.M., Sinton CM., Yanagisawa M. ; 

RT "To eat or to sleep? Orexin in the regulation of feeding and 

RT wakefulness."; 

RL Annu. Rev. Neurosci. 24:429-458(2001). 

CC -!- FUNCTION: Nonselective, high-affinity receptor for both orexin-A 
CC and orexin-B neuropeptides . 

CC -!- SUBCELLULAR LOCATION: Integral membrane protein. 

CC -!- SIMILARITY: Belongs to family 1 of G-protein coupled receptors. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib . ch) . 

CC 

DR EMBL; AF041245; AAC39602.1; 



DR EMBL; AY062031; AAL47215.1; -. 

DR Genew; HGNC:484 9; HCRTR2 . 

DR MIM; 602393; 

DR GO; GO: 0005887; Crintegral to plasma membrane; TAS. 

DR GO; GO: 0008188; F: neuropeptide receptor activity; TAS. 

DR GO; GO: 0007631; P : feeding behavior; TAS. 

DR GO; GO: 0007218; P : neuropeptide signaling pathway; TAS. 

DR GO; GO:0007268; P:synaptic transmission; TAS . 

DR InterPro; IPR000276; GPCR_Rhodpsn . 

DR InterPro; IPR004060; Orexin_receptor2 . 

DR Pfam; PF00001; 7tm_l; 1. 

DR Pfam; PF03827; Orexin_rec2; 1. 

DR PRINTS; PR00237; GPCRRHODOPSN . 

DR PRINTS; PR01522; OREXIN2R. 

DR PROSITE; PS00237; G_PROTEIN_RECEP_Fl_l ; 1. 

DR PROSITE; PS50262; G_PROTEIN_RECEP_Fl_2 ; 1. 

KW G-protein coupled receptor; Transmembrane; Glycoprotein. 



FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
SQ 



DOMAIN 


1 


54 




EXTRACELLULAR (POTENTIAL) . 


TRANSMEM 


55 


75 




1 (POTENTIAL) . 


DOMAIN 


76 


88 




CYTOPLASMIC (POTENTIAL) . 


TRANSMEM 


89 


110 




2 (POTENTIAL) . 


DOMAIN 


111 


127 




EXTRACELLULAR (POTENTIAL) . 


TRANSMEM 


128 


150 




3 (POTENTIAL). 


DOMAIN 


151 


172 




CYTOPLASMIC (POTENTIAL) . 


TRANSMEM 


173 


193 




4 (POTENTIAL) . 


DOMAIN 


194 


224 




EXTRACELLULAR (POTENTIAL) . 


TRANSMEM 


225 


247 




5 (POTENTIAL). 


DOMAIN 


248 


304 




CYTOPLASMIC (POTENTIAL) . 


TRANSMEM 


305 


327 




6 (POTENTIAL). 


DOMAIN 


328 


342 




EXTRACELLULAR (POTENTIAL) . 


TRANSMEM 


343 


366 




7 (POTENTIAL) . 


DOMAIN 


367 


444 




CYTOPLASMIC (POTENTIAL) . 


CARBOHYD 


14 


14 




N-LINKED (GLCNAC. . .) (POTENTIAL) 


CARBOHYD 


22 


22 




N-LINKED (GLCNAC. . .) (POTENTIAL) 


CARBOHYD 


202 


202 




N-LINKED (GLCNAC. . .) (POTENTIAL) 


) SEQUENCE 


444 AA; 


5068 


:0 MW; 


CA0669F0D4224C65 CRC64; 


Query Match 




65. 


8%; 


Score 1460; DB 1; Length 444; 


Best Local Similarity 


68. 


8%; 


Pred. No. 1.3e-92; 



Matches 284; Conservative 50; Mismatches 69; Indels 10; Gaps 



Qy 

Db 



17 S REP S PVP P DYED- E FLRYLWRD YLYPKQYEWVLI AAWAVFVVALVGNT LVCLAVWRNH 75 

: : I I I I I : I I I I I I II I : I I : I I : I I I I I I I I : I I I I I I : I I I I I : I I I : I I 
24 TQE P FLN PT D YDDEEFLRYLWRE YLH PKE YEWVL I AGYI I VFVVALI GNVLVCVAVWKNH 83 



Qy 

Db 

Qy 

Db 

Qy 

Db 



7 6 HMRTVTN Y FI VNL S LADVLVT AI C L PAS LLVD I T E S W L F GHALC KVI P YLQAVS VS VAtf 'L 135 

I I I I II I I I I I I I I I I I I I I I I I t I : I : I I I I I : I M : I I I I I I I I I I I I I I : I I 

84 HMRTVTNYFIVNLS LADVLVT I TCLPATLWDI TETWFFGQSLCKVI PYLQTVSVSVSVL 14 3 

136 TLSFIALDRWYAICHPLLFKSTARRARGSILGIWAVSLAIMVPQAAVMECSSVLPELANR 195 

III I I I I I I I I I I I I I : I I I I I : I II II: II II I I : I I I I I I I I : I I III: 

144 TLSCIALDRWYAICHPLMFKSTAKRARNSIVIIWIVSCIIMIPQAIVMECSTVFPGLANK 203 

196 TRLFSVCDERWADDLYPKIYHSCFFIVTYLAPLGLMAMAYFQIFRKLWGRQIPGTTSALV 255 

I I I : I I I I I I : : II I : I I I I I : I I I : I I I II : I I I I I I I I I I I I I I I : I : 

204 TTLFTVCDERWGGEI YPKMYHI CFFLVT YMAPLCLMVLAYLQI FRKLWCRQI PGT S SWQ 263 



Qy 256 RNWK- - RP S DQLGDLEQGL S GEPQ PRARAFLAEVKQMRARRKTAKMLMWLLVFALC YLP 313 

Ml : I I : I : I I I I : I I : I I I I I II : I I I I I I I I I I : | t I I 

Db 264 RKWKPLQPVSQ PRGPGQPTKSRMSAVAAEIKQIRARRKTARMLMWLLVFAICYLP 319 

Qy 314 I S VLNVLKRVFGMFRQAS DREAVYAC FT FS HWLVYAN S AANP 1 1 YN FLS GKFREQFKAAF 373 

I I : I M I I I I I I I I III III I I I I I II I I II | I | | M | | | | | | | | | | | : | | | | | 

Db 320 ISILNVLKRVFGMFAHTEDRETVYAWFTFSHWLVYANSAANPII YNFLSGKFREEFKAAF 379 

Qy 374 SCCLPGLGPCGSLKAPSPRSSA-SHKSLSLQ--SRCSVSKISEHWLTSVTTV 423 

M I I : : I : I I I I I : I : : : II : I I I I I I I : : I : 

Db 380 SCCCLGVHHRQEDRLTRGRTSTESRKSLTTQISNFDNISKLSEQWLTSISTL 432 



RESULT 7 
OX1R MOUSE 



ID OXlR_MOUSE STANDARD; PRT; 260 AA. 

AC P58307; 

DT 16-OCT-2001 (Rel. 40, Created) 

DT 16-OCT-2001 (Rel. 40, Last sequence update) 

DT 28-FEB-2003 (Rel. 41, Last annotation update) 

DE Orexin receptor type 1 (Oxlr) (Hypocretin receptor type 1) (Fragment) . 

GN HCRTR1. 

OS Mus mus cuius (Mouse) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Mus. 

OX NCBI_TaxID=10090; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC STRAIN=C57BL/6; 

RA Szendro P.I., Maevers K. , Eichele G. ; 

RT "Cloning of mouse orexin receptors."; 

RL Submitted (JUN-2001) to the EMBL/ GenBank/DDBJ databases. 

RN [2] 

RP REVIEW. 

RX MEDLINE=21237974; PubMed=1134062 1 ; 

RA Hungs M. , Mignot E. ; 

RT "Hypocretin/orexin, sleep and narcolepsy."; 

RL Bioessays 23:397-4 08(2001). 

RN [3] 

RP REVIEW. 

RX MEDLINE=21178476; PubMed=11283317 ; 

RA Willie J.T., Chemelli R.M., Sinton CM., Yanagisawa M. ; 

RT "To eat or to sleep? Orexin in the regulation of feeding and 

RT wakefulness . "; 

RL Annu. Rev. Neurosci. 24:429-458(2 001). 

CC -!- FUNCTION: Moderately selective excitatory receptor for orexin-A 
CC and, with a lower affinity, for orexin-B neuropeptide. Seems to be 

CC exclusively coupled to the G(q) subclass of heteromeric G 

CC proteins, which activates the phospholipase C mediated signaling 

CC cascade (By similarity) . 

CC -!- SUBCELLULAR LOCATION: Integral membrane protein. 

CC -!- SIMILARITY: Belongs to family 1 of G-protein coupled receptors. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 



CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib . ch) . 

CC 

DR EMBL; AF394596; AAK71326.1; 

DR MGD; MGI: 2385650; Hcrtrl. 

DR InterPro; IPR000276; GPCR_Rhodpsn . 

DR InterPro; IPR004059; Orexin_receptor 1 . 

DR Pfam; PF00001; 7tm_l; 1. 

DR PRINTS; PR00237; GPCRRHODOPSN . 

DR PRINTS; PR01521; 0REXIN1R. 

DR PROSITE; PS00237; G_PR0TEIN_RECEP_F1_1 ; FALSE_NEG. 

DR PROSITE; PS502 62; G_PR0TEIN_RECEP_F1_2 ; 1. 

KW G-protein coupled receptor; Transmembrane. 

FT NON_TER 1 1 

FT TRANSMEM <1 9 3 (POTENTIAL). 

FT DOMAIN 10 31 CYTOPLASMIC (POTENTIAL) . 

FT TRANSMEM 32 52 4 (POTENTIAL). 

FT DOMAIN 53 83 EXTRACELLULAR (POTENTIAL) . 

FT TRANSMEM 84 106 5 (POTENTIAL) . 

FT DOMAIN 107 165 CYTOPLASMIC (POTENTIAL) . 

FT TRANSMEM 166 18 8 6 (POTENTIAL) . 

FT DOMAIN 189 2 03 EXTRACELLULAR (POTENTIAL) . 

FT TRANSMEM 204 227 7 (POTENTIAL). 

FT DOMAIN 22 8 >260 CYTOPLASMIC (POTENTIAL) . 

FT CARBOHYD 61 61 N-LINKED (GLCNAC . . .) (POTENTIAL). 

FT NON_TER 2 60 2 60 

SQ SEQUENCE 260 AA; 29419 MW; AC4E67176AFC662C CRC64; 

Query Match 56.8%; Score 1261.5; DB 1; Length 260; 

Best Local Similarity 91.4%; Pred. No. 2.6e-79; 

Matches 246; Conservative 3; Mismatches 11; Indels 9; Gaps 1; 

Qy 134 VLTLSFIALDRWYAICHPLLFKSTARRARGSILGIWAVSIAIMVPQAAVMECSSVLPELA 193 

I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I : I I I I I I I | | | | | | | | | | | 

Db 1 VLTLSVIALDRWYAICHPLLFKSTARRARGSILGIWAVS^VWPQAAVMECSSVLPELA 60 

Qy 194 NRTRLFSVCDERWADDLYPKIYHSCFFIVTYLAPLGLMAMAYFQIFRKLWGRQIPGTTSA 253 

I I I I I I I I I : I II I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I 

Db 61 NRTRLFSVCDEHWADELYPKIYHSCFFIVTYLAPLGLMGMAYFQIFRKLWGRQIPGTTSA 12 0 

Qy 254 LVRNWKRPSDQLGDLEQGLSGEPQPRARAFLAEVKQMRARRKTAKMLMWLLVFALCYLP 313 

111111111:11 III I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 1 I I I 

Db 121 LVRNWKRPSEQLEAQHQGLCTEPQPRARAFLAEVKQMRARRKTAKMLMWLLVFALCYLP 180 

Qy 314 I S VLN VL KRVFGMFRQAS DREAVYAC FT F ^ HWLVYAN S AAN P 1 1 YN FL S GKFREQ FKAAF 373 

M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 181 I S VLNVLKRVFGMFRQAS DREAVYACFT FS HWLVYANS AAN P 1 1 YNFL S GKFREQFKAAF 240 

Qy 374 SCCLPGLGPCGSLKAPSPRSSASHKSLSL 4 02 

I I I I I I I I I I I I I I I I I I 

Db 241 SCCLPGLG PGSSARHKSLSL 260 



RESULT 8 
NFF1_HUMAN 

ID NFF1_HUMAN STANDARD; PRT; 430 AA. 



AC Q9GZQ6; Q8NGR0 ; Q96RN3; 

DT 16-OCT-2001 (Rel. 40, Created) 

DT 16-OCT-2001 (Rel. 40, Last sequence update) 

DT 28-FEB-2003 (Rel. 41, Last annotation update) 

DE Neuropeptide FF receptor 1 ( RFamide-related peptide receptor OT7T022). 

GN NPFF1. 

OS Homo sapiens (Human) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 

OX NCBI_TaxID=9606; 
RN [1] 

RP SEQUENCE FROM N.A. 

RC TISSUE=Spinal cord; 

RX MEDLINE=20564301; PubMed=l 1024 015 ; 

RA Bonini J. A., Jones K.A., Adham N . , Forray C, Artymyshyn R. , 

RA Durkin M.M., Smith K.E., Tamm J. A., Boteju L.W., Lakhlani P.P., 

RA Raddatz R. , Yao W.-J., Ogozalek K.L., Boyle N., Kouranova E.V., 

RA Quan Y., Vaysse P. J., Wetzel J.M., Branchek T.A., Gerald C, 

RA Borowsky B. ; 

RT "Identification and characterization of two G protein-coupled 

RT receptors for neuropeptide FF."; 

RL J. Biol. Chem. 275:39324-3 9331(2 000). 

RN [2] 

RP SEQUENCE FROM N.A. 

RX MEDLINE=20482175; PubMed=11025660 ; 

RA Hinuma S., Shintani Y. , Fukusumi S., Iijima N., Matsumoto Y., 

RA Hosoya M. , Fujii R. , Watanabe T., Kikuchi K., Terao Y. , Yano T., 

RA Yamamoto T., Kawamata Y., Habata Y., Asada M. , Kitada C, Kurokawa T . , 

RA Onda H., Nishimura O., Tanaka M. , Ibata Y., Fujino M. ; 

RT "New neuropeptides containing carboxy-terminal RFamide and their 

RT receptor in mammals."; 

RL Nat. Cell Biol. 2:703-708(2000). 

RN [3] 

RP SEQUENCE FROM N.A. 

RA Liu Q. , Guan X.-M., McDonald T.P., Jiang Q. , Zeng Z., Marlene J., 

RA Williams D.L. Jr., Hong Y. , Figueroa D . , Clements M.K., Mallee J., 

RA Wang R. , Evans J., Gould R., Austin CP. ; 

RT "Identification and characterization of two cognate receptors for 

RT mammalian FMRFamide-like neuropeptides."; 

RL Submitted (DEC-2000) to the EMBL/ GenBank/DDBJ databases. 

RN [4] 

RP SEQUENCE FROM N.A. 

RA Suwa M., Sato T., Okouchi I., Arita M., Futami K., Matsumoto S., 

RA Tsutsumi S., Aburatani H., Asai K., Akiyama Y. ; 

RT "Genome-wide discovery and analysis of human seven transmembrane helix 

RT"' receptor genes."; 

Submitted (JUL-2001) to the EMBL/ GenBank/DDBJ databases. 

CC -!- FUNCTION: Receptor for NPAF (A-18-F-amide) and NPFF ( F-8-F- amide ) 
CC neuropeptides, also known as morphine-modulating peptides. Can 

CC also be activated by a variety of naturally ocurring or synthetic 

CC FMRF- amide like ligands . This receptor mediates its action by 

CC association with G proteins that activate a phosphatidylinositol- 

CC calcium second messenger system. 

CC -!- SUBCELLULAR LOCATION: Integral membrane protein. 

CC -!- SIMILARITY: Belongs to family 1 of G-protein coupled receptors. 

CC HIGHEST TO OREXIN, NPY AND CHOLECYSTOKININ RECEPTORS. 

CC -!- CAUTION: Ref.4 sequence differs from that shown due to erroneous 



CC gene model prediction. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib . ch) . 

CC 

DR EMBL; AF268898; AAG41397.1; -. 

DR EMBL; AB040104; BAB17677.1; -. 

DR EMBL; AF330055; AAK94199.1; 

DR EMBL; AB065729; BAC05950.1; ALT_SEQ . 

DR MIM; 607448; -. 

DR GO; GO: 0016021; C:integral to membrane; NAS. 

DR GO; GO: 0004930; F:G-protein coupled receptor activity; NAS. 

DR GO; GO: 0008188; F : neuropeptide receptor activity; NAS. 

DR InterPro; IPR000276; GPCR_Rhodpsn . 

DR InterPro; IPR005395; NPFF_receptor . 

DR InterPro; IPR005396; NPFF_receptorl . 

DR Pfam; PF00001; 7tm_l; 1. 

DR PRINTS; PR00237; GPCRRHODOPSN . 

DR PRINTS; PR01570; NPFFRECEPTOR . 

DR PRINTS; PR01571; NPFFRECEPTR1 . 



DR 


PROSITE; 


PS00237; 


G_PROTEIN 


RECEP Fl 1; 1. 


DR 


PROSITE; 


PS50262; 


G__PROTEIN_ 


^RECEP_F1_2 ; 1 . 


KW 


G-protein 


coupled 


receptor; 


Transmembrane; Glycoprotein. 


FT 


DOMAIN 


1 


43 


EXTRACELLULAR (POTENTIAL) . 


FT 


TRANSMEM 


44 


64 


1 (POTENTIAL) . 


FT 


DOMAIN 


65 


80 


CYTOPLASMIC (POTENTIAL) . 


FT 


TRANSMEM 


81 


101 


2 (POTENTIAL) . 


FT 


DOMAIN 


102 


117 


EXTRACELLULAR (POTENTIAL) . 


FT 


TRANSMEM 


118 


138 


3 (POTENTIAL) . 


FT 


DOMAIN 


139 


158 


CYTOPLASMIC (POTENTIAL) . 


FT 


TRANSMEM 


159 


179 


4 (POTENTIAL). 


FT 


DOMAIN 


180 


214 


EXTRACELLULAR (POTENTIAL) . 


FT 


TRANSMEM 


215 


235 


5 (POTENTIAL) . 


FT 


DOMAIN 


236 


271 


CYTOPLASMIC (POTENTIAL) . 


FT 


TRANSMEM 


272 


292 


6 (POTENTIAL) . 


FT 


DOMAIN 


293 


307 


EXTRACELLULAR (POTENTIAL) . 


FT 


TRANSMEM 


308 


328 


7 (POTENTIAL). 


FT 


DOMAIN 


329 


430 


CYTOPLASMIC (POTENTIAL) . 


FT 


DISULFID 


116 


203 


BY SIMILARITY. 


FT 


CARBOHYD 


10 


10 


N-LINKED ( GLCNAC . . .) (POTENTIAL) 


FT 


CARBOHYD 


18 


18 


N-LINKED (GLCNAC. . .) {POTENTIAL) 


FT 


CARBOHYD 


29 


29 


N-LINKED (GLCNAC. . .) (POTENTIAL) 


FT 


CARBOHYD 


113 


113 


N-LINKED (GLCNAC. . .) (POTENTIAL) 


FT 


CARBOHYD 


195 


195 


N-LINKED (GLCNAC. . .) (POTENTIAL) 


FT 


CONFLICT 


376 


376 


V -> A (IN REF. 3) . 


SQ 


SEQUENCE 


430 AA; 


47818 MW; BBB0146281B2B9A0 CRC64; 


Query Match 




23.5%; 


Score 522.5; DB 1; Length 430; 



Best Local Similarity 33.6%; Pred. No. 9.2e-29; 

Matches 137; Conservative 56; Mismatches 150; Indels 65; Gaps 9; 
Qy 2 E P SAT P GAQMGVP P G S RE P SPVPPDYEDEFLRYLWRDYLYPKQYEWVLIAAYV 54 



Db 


4 


' 1 II 1 .ill 

iii i i i i i ii i : j | | 
EPSQ PPNSSWPLSQNGTNTEATPATNLTFSSY YQHTS PVAAMFI VAYA 51 


Qy 


55 


AVFWALVGNTLVCIAWRNHHMRTVTNYFIWLSI^VLVTAICLPASLLVDITESWLF 114 
: 1 : : : 1 1 1 1 1 1 I 1 : 1 M 1 1 1 1 1 1 • I 1 • • • 1 • II 1 • 1 • 1 • • • i i 

i i «i ii iiii | 1 • 1 1 • • • I • 1 1 1 • 1 • 1 • • . ! ! 

LI FLLCMVGNTLVCFI VLKNRHMHTVTNMFILNLAVSDLLVGI FCMPTTLVDNLITGWPF 111 


Db 


52 


Qy 


115 


GH7VLCKVIPYLQAVSVSVAVLTLSFIALDRWYAICHPLLFKSTARRARGSILGIWAVSLA 174 

: 1 II: : 1 : 1 1 1 : 1 II 1 1 • • 1 • 1 II 1 I I • i * i I I i . . i 
i ii •! -iii • i ii i i . . | . | ii | | | . | , | | | | : : | 

DNATCmSGLVQGMSVSASVFTLVAIAVERFRCIVHPFREKLTLRKALVTIAVIWALALL 171 


Db 


112 


Qy 


175 


IMVPQAAVM ECSSVLPELANRTRLFSVCDERWADDLYPKIYHSCFFIVTYLAPLGL 230 

IIII : 1 ■ 1 1 • III* • • i i i i i i i i 
1 1 1 i i ■ • i I • Ill- . . i . | (Mill 

IMCPSAWLTWREEHHFIWDARNRSYPLYSCWEAWPEKGMRRVYTTVLFSHIYLAPI^ 231 


Db 


172 


Qy 


231 


MAMAYFQIFRKLWGRQIPGTTSALVRNWKRPSDQLGDLEQGLSGEPQPRARAFLAEVKQM 2 90 
: : 1 : 1 1 1 1 1 II I i i . . 


Db 


232 


• • i • i I I I III I] | ; • 
IWMYARIARKLC— QAPG PAPGGEE-AADPRAS 262 


Qy 


291 


RARRKTAKMLMWLLVFALC YLP I S VLNVLKRVFGMFRQAS DREAVYAC FT FSHWLVYAN 350 
1 1 : 1 1 : : 1 1 1 1 : 1 1 : I : | : 1 1 1 1 1 : 1 1 1 : I 
RRRARVVHMLVMVALFFTLSWLPLWALLLLIDYGQLSAPQLHLVTVYA-FPFAHWLAFFN 321 


Db 


263 


Qy 


351 


SAANPIIYNFLSGKFREQFKAAFSCCLPGLGPCGSLKAPSPRSSASHK 398 

1 : 1 1 1 1 1 1 : : | | 1 : I I | 1 1 1 1 1 1 

S SAN PI IYGYFNEN FRRG FQAAFRARL CPRPSGSHK 357 


Db 


322 



RESULT 9 
NFF1_RAT 

ID NFF1_RAT STANDARD; PRT; 432 AA. 

AC Q9EP86; 

DT 16-OCT-2001 (Rel. 40, Created) 

DT 16-OCT-2001 (Rel. 40, Last sequence update) 

DT 28-FEB-2003 (Rel. 41, Last annotation update) 

DE Neuropeptide FF receptor 1 (RFamide-related peptide receptor OT7T022) . 

GN NPFFl. 

OS Rattus norvegicus (Rat) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleos tomi ; 

OC Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Rattus. 

OX NCBI_TaxID=10116; 

RN [1] 

RP SEQUENCE FROM N.A., AND TISSUE SPECIFICITY. 

RC TISSUE=Brain stem; 

RX MEDLINE-20482175; PubMed=11025660; 

RA Hinuma S., Shincani Y. , Fukusumi S., Iijima N., Matsumoto Y. , 

RA Hosoya M. , Fujii R. , Watanabe T., Kikuchi K., Terao Y., Yano T., 

RA Yamamoto T., Kawamata Y., Habata Y. , Asada M. , Kitada C. , Kurokawa T., 

RA Onda H., Nishimura O., Tanaka M. , Ibata Y. f Fujino M. ; 

RT "New neuropeptides containing carboxy-terminal RFamide and their 

RT receptor in mammals."; 

RL Nat. Cell Biol. 2:703-708(2000). 

RN [2] 

RP SEQUENCE FROM N.A. 

RC STRAIN=Sprague-Dawley; TISSUE=Hypothalamus ; 

RX MEDLINE=2 0564301; PubMed=11024015 ; 

RA Bonini J. A., Jones K.A., Adham N., Forray C, Artymyshyn R. , 



RA Durkin M.M., Smith K.E., Tamm J. A., Boteju L.W., Lakhlani P.P., 

RA Raddatz R. , Yao W.-J., Ogozalek K.L., Boyle N., Kouranova E.V., 

RA Quan Y. , Vaysse P. J., Wetzel J.M., Branchek T.A., Gerald C, 

RA Borowsky B. ; 

RT "Identification and characterization of two G protein-coupled 

RT receptors for neuropeptide FF . " ; 

RL J. Biol. Chem. 275:39324-39331(2000). 

CC -!- FUNCTION: Receptor for NPAF (A-18-F-amide ) and NPFF ( F-8-F-amide) 
CC neuropeptides, also known as morphine-modulating peptides. Can 

CC also be activated by a variety of naturally ocurring or synthetic 

CC FMRF-amide like ligands. This receptor mediates its action by 

CC association with G proteins that activate a phosphatidylinositol- 

CC calcium second messenger system. 

CC -!- SUBCELLULAR LOCATION: Integral membrane protein. 

CC -!- TISSUE SPECIFICITY: Expressed at high levels in the hypothalamus. 

CC Moderate levels found in the midbrain, thalamus, medulla 

CC oblongata, testis, eye, whole brain, cerebral cortex, striatum, 

CC hippocampus, cerebellum, optic nerve, placenta, spinal cord, 

CC pituitary gland and ovary. 

CC -!- SIMILARITY: Belongs to family 1 of G-protein coupled receptors. 
CC HIGHEST TO OREXIN, NPY AND CHOLECYSTOKININ RECEPTORS. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib . ch) . 

CC : 

DR EMBL; AB040103; BAB17676.1; -. 

DR EMBL; AF268901; AAG41400.1; -. 

DR InterPro; IPR000276; GPCR_Rhodpsn . 

DR InterPro; IPR005395; NPFF_receptor . 

DR InterPro; IPR005396; NPFF_receptor 1 . 

DR Pfam; PF00001; 7tm_l; 1. 

DR PRINTS; PR00237; GPCRRHODOPSN . 

DR PRINTS; PR01570; NPFFRECEPTOR . 

DR PRINTS; PR01571; NPFFRECEPTR1 . 

DR PROSITE; PS00237; G_PROTEIN_RECEP_Fl_l ; 1. 

DR PROSITE; PS50262; G_PR0TEIN_RECEP_F1_2 ; 1. 

KW G-protein coupled receptor; Transmembrane; Glycoprotein. 
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180 


214 


EXTRACELLULAR (POTENTIAL) 
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6 (POTENTIAL) . 


FT 


DOMAIN 


295 


309 


EXTRACELLULAR (POTENTIAL) 


FT 


TRANSMEM 


310 


330 


7 (POTENTIAL) . 


FT 


DOMAIN 


331 


432 


CYTOPLASMIC (POTENTIAL) . 



FT DISULFID 


116 


203 


BY SIMILARITY. 




FT CARBOHYD 


10 


10 


N-LINKED (GLCNAC. . 


.) (POTENTIAL) 




1 R 
-L O 


1 0 


N-LINKED (GLCNAC. . 


.) (POTENTIAL) 


FT CARBOHYD 


113 


1 1 O 

113 


N-LINKED (GLCNAC. . 


.) (POTENTIAL) 


FT CARBOHYD 


195 


195 




. ) (POTENTIAL) 


SQ SEQUENCE 


432 AA; 


48323 


MW; 827325849444C408 


CRC64; 


Query Match 




22. 9h 


h; Score 507.5; DB 1; 


Length 432; 


Best Local Similarity 


34.7^ 


Pred. No. 9.7e-2 8; 




Matches 135; 


Conservative 


60; Mismatches 143; 


Indels 51; 



Qy 2 EPSATPGAQMGVPPGSREPSPVPPDYEDE--FLRYLWRDYLYPKQYEWVLIAAYVAVFW 59 

IN I I I : I I || | : : I I I I I : I : : 

Db 4 EPSQPPN GSWPLGQNGSDVETSMATSLTFSSY YQHSSPVAAMFIAAYVLIFLL 56 

Qy 60 ALVGNTLVCLAWRNHHMRTVTNYFIWLSIJUDVLVTAICLPASLLVDITESWLFGHALC 119 

= 1111111 I : I I II I I I I I I : I I : : : I : I | I : I : I : : : I I : I I 
Db 57 CMVGNTLVCFIVLKNRHMRTVTNMFILNLAVSDLLVGIFCMPTTLVDNLITGWPFDNATC 116 

Qy 120 KVIPYLQAVSVSVAVLTLSFIALDRWYAICHPLLFKSTARRARGSILGIWAVS LAIMVPQ 179 

Is : I : I I I : I I I I I : : I : III I hi : | I I I :: I I | I 

Db 117 KMSGLVQGMSVSASVFTLVAIAVERFRCIVHPFREKLTLRKALFTIAVIWALALLIMCPS 176 

Qy 180 AAVM ECSSVLPELANRTRLFSVCDERWADDLYPKIYHSCFFIVTYLAPLGLMAMAY 235 

I : I : : I I : III: I : I : I I I I I I :: I 

Db 177 AVTLTWREEHHFMLDARNRSYPLYSCWEAWPEKGMRKVTTAV^ 236 

QY 236 FQIFRKLWGRQIPGTTSALVRNWKRPSDQLGDLEQGLSGEPQPRARAFLAEVKQMRARRK 2 95 

: I I I I I I I ||: | : | | : 

Db 237 VRIARKLC— QAPGPAR DTEE AVAEGGRTSRRRAR 2 69 

Qy 296 TAKMLMWLLVFALC YL P I S VLNVLKRVFGMFRQAS DRE AVY AC FT F S H W L VYAN S 351 

I I : : I I I I : I I : I I : I : : | : : : | | | | : | | | : : | 

Db 270 WHMLVMVALFFTLSWLPLWVLLLLID YGELSELQLHLLSVYA- FPLAHWLAFFHS 32 4 

Qy 352 AANPIIYNFLSGKFREQFKAAF--SCCLP 378 

: I I I I I I : : II I : M I II 
Db 325 SANPIIYGYFNENFRRGFQAAFRAQLCWP 353 

RESULT 10 
NFF2 HUMAN 



ID NFF2_HUMAN STANDARD; PRT; 522 AA. 

AC Q9Y5X5; Q96RV1; Q9NR4 9; 

DT 16-OCT-2001 (Rel. 40, Created) 

DT 16-OCT-2001 (Rel. 40, Last sequence update) 

DT 10-OCT-2003 (Rel. 42, Last annotation update) 

DE Neuropeptide FF receptor 2 (Neuropeptide G protein-coupled receptor) 

DE (G-protein-coupled receptor HLWAR77). 

GN GPR74 OR NPGPR OR NPFF2. 

OS Homo sapiens (Human) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleos tomi ; 

OC Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 

OX NCBI_TaxID=9606; 

RN [1] 

RP SEQUENCE FROM N.A. (ISOFORM 1), AND TISSUE SPECIFICITY. 

RC TISSUE=Fetal; 



RX MEDLINE=99180505; PubMed^l 007 9187 ; 

RA Cikos S., Gregor P., Koppel J. ; 

RT "Sequence and tissue distribution of a novel G-protein-coupled 

RT receptor expressed prominently in human placenta."; 

RL Biochem. Biophys . Res. Commun. 256:352-356(1999). 

RN [2] 

RP SEQUENCE FROM N . A. (ISOFORM 2), AND CHARACTERIZATION. 

RC TISSUE=Brain; 

RX MEDLINE-20408933; PubMed=10851242 ; 

RA Elshourbagy N.A., Ames R.S., Fitzgerald L.R., Foley J. J., 

RA Chambers J.K., Szekeres P.G., Evans N.A., Schmidt D.B., Buckley P.T., 

RA Dytko G.M., Murdock P.R., Milligan G., Groarke D.A., Tan K.B., 

RA Shabon U., Nuthulaganti P., Wang D.Y., Wilson S., Bergsma D.J., 

RA Sarau H.M. ; 

RT "Receptor for the pain modulatory neuropeptides FF and AF is an orphan 

RT G protein-coupled receptor."; 

RL J. Biol. Chem. 275:25965-25971(2000). 

RN [3] 

RP SEQUENCE FROM N.A. (ISOFORM 2), AND CHARACTERIZATION. 

RC TISSUE=Spinal cord; 

RX MEDLINE=20564301; PubMed-1102 4 015 ; 

RA Bonini J. A., Jones K.A. , Adham N., Forray C, Artymyshyn R., 

RA Durkin M.M., Smith K.E., Tamm J.A. , Boteju L.W., Lakhlani P.P., 

RA Raddatz R. , Yao W.-J., Ogozalek K.L., Boyle N . , Kouranova E.V., 

RA Quan Y. , Vaysse P.J., Wetzel J.M., Branchek T . A. , Gerald C, 

RA Borowsky B. ; 

RT "Identification and characterization of two G protein-coupled 

RT receptors for neuropeptide FF. " ; 

RL J. Biol. Chem. 275:39324-39331(2000). 

RN [4] 

RP SEQUENCE FROM N.A. (ISOFORM 3). 

RX MEDLINE=20299143; PubMed=10837 915 ; 

RA Parker R.M.C., Copeland N.G., Eyre H.J., Liu M. , Gilbert D.J., 

RA Crawford J., Couzens M. , Sutherland G.R., Jenkins N.A., Herzog H.; 

RT "Molecular cloning and characterisation of GPR74 a novel G-protein 

RT coupled receptor closest related to the Y-receptor family."; 

RL Brain Res. Mol . Brain Res. 77:199-208(2000). 

RN [5] 

RP SEQUENCE FROM N.A. (ISOFORM 2). 

RA Liu Q. , Guan X.-M., McDonald T.P., Jiang Q., Zeng Z., Marlene J., 

RA Williams D.L. Jr., Hong Y., Figueroa D., Clements M.K., Mallee J. , 

RA Wang R. , Evans J., Gould R. , Austin CP. ; 

RT "Identification and characterization of two cognate receptors for 

RT mammalian FMRFamide-like neuropeptides."; 

RL Submitted (DEC-2000) to the EMBL/ GenBank/DDB J databases. 

'RN [6] 

RP SEQUENCE FROM N.A. (ISOFORM 4). 

RA Laemmle B.S., Schindler M. , Beilmann M. , Hamilton B.S., Doods H.N., 

RA Wieland H.A. ; 

RT "Cloning and characterization of the NPGP receptor and identification 

RT of a novel short mRNA isoform in human hypothalamus."; 

RL Submitted (MAY-2001) to the EMBL/ GenBank/DDB J databases. 

CC FUNCTION: Receptor for NPAF (A-18-F- amide ) and NPFF (F-8-F- amide) 

CC neuropeptides, also known as morphine-modulating peptides. Can 

CC also be activated by a variety of naturally ocurring or synthetic 

CC FMRF-amide like ligands . This receptor mediates its action by 

CC association with G proteins that activate a phosphatidylinositol- 



CC calcium second messenger system. 

CC -!- SUBCELLULAR LOCATION: Integral membrane protein. 

CC -!- ALTERNATIVE PRODUCTS: 

CC Event=Alternative splicing; Named isoforms=4; 

CC Comment=Experimental confirmation may be lacking for some 

CC isoforms; 

CC Name=l; Synonyms=long form; 

CC IsoId=Q9Y5X5-l; Sequence=Displayed; 

CC Name=2 ; Synonyms=short form; 

CC IsoId=Q9Y5X5-2; Sequence=VSP_001907 ; 

CC Name=3 ; 

CC IsoId=Q9Y5X5-3; Sequence=VSP_001908 , VSP_001909; 

CC Name=4; 

CC IsoId=Q9Y5X5-4; Sequence=VSP_001910, VSP_001911; 

CC -!- TISSUE SPECIFICITY: Isoform 1 is abundant in placenta. Relatively 

CC highly expressed in thymus, testis, and small intestine. Expressed 

CC at low levels in several tissues including spleen, prostate, 

CC brain, heart, ovary, colon, kidney, lung, liver and pancreas and 

CC not expressed in skeletal muscle and leukocytes. Highest but 

CC relatively low level of isoform 2 in placenta and very low level 

CC in numerous tissues including adipose tissue and many brain 

CC regions. Isoform 3 is expressed in brain and heart and, at lower 

CC levels, in kidney, liver, lung and pancreas. 

CC SIMILARITY: Belongs to family 1 of G-protein coupled receptors. 

CC HIGHEST TO OREXIN, NPY AND CHOLECYSTOKININ RECEPTORS. 

CC -!- CAUTION: Ref.4 sequence differs from that shown due to a 
CC frameshift in position 503. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib.ch). 

CC 

DR EMBL; AF119815; AAD22047.1; -. 

DR EMBL; AF257210; AAF87078.1; -. 

DR EMBL; AF268899; AAG41398.1; 

DR EMBL; AF236083; AAK58513.1; ALT_ FRAME . 

DR EMBL; AF330053; AAK94197.1; 

DR EMBL; AJ311393; CAC85427.1; -. 

DR Genew; HGNC:4525; GPR74. 

DR MIM; 607449; -. 

DR GO; GO: 0005887; C: integral to plasma membrane; TAS . 

DR GO; GO: 0004930; F:G-protein coupled receptor activity; IAS. 

DR GO; GO: 0008151; P:cell growth and/or maintenance; TAS. 

DR GO; GO:0007186; P:G-protein coupled receptor protein signalin. . .; TAS. 

DR GO; GO: 0009582; P: perception of abiotic stimulus; TAS. 

DR InterPro; IPR000276; GPCR_Rhodpsn . 

DR InterPro; IPR005395; NPFF_receptor . 

DR InterPro; IPR005397; NPFF_receptor2 . 

DR Pfam; PF00001; 7tm_l; 1. 

DR PRINTS; PR00237; GPCRRHODOPSN . 

DR PRINTS; PR01570; NPFFRECEPTOR. 

DR PRINTS; PR01572; NPFFRECEPTR2 . 

DR PROSITE; PS00237; G_PROTEIN RECEP Fl 1; 1. 
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T7»rp 
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N-LINKED (GLCNAC. . .) (POTENTIAL). 
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Missing (in isoform 2) . 










/FTId=VSP 001907. 


T7""P 


VARbrLIC 


1 


99 


Missing (in isoform 3) . 


r 1 








/FTId=VSP 001908. 


T7>m 

r I 


VARSPLIC 
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R -> M (in isoform 3) . 


r 1 








/FTId=VSP 001909. 


r 1 


VARSPLIC 
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FIMNEKWDTNSSENWHPIWNVNDTKHHLYSDI -> MAIWK 
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4 ) . 


FT 








/FTId-VSP_001910. 


FT 


VARSPLIC 


133 
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Missing (in isoform 4). 


FT 








/FTId=VSP_001911. 


FT 


CONFLICT 


466 


466 


A -> T (IN REF. 1 AND 4) . 


SQ 


SEQUENCE 


522 AA; 


60270 MW 


; 40CB9FCD42F77041 CRC64; 



Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 



Query Match 22.5%; Score 499.5; DB 1; Length 522; 

Best Local Similarity 33.4%; Pred. No. 4.1e-27; 

Matches 120; Conservative 59; Mismatches 131; Indels 49; Gaps 7 

27 YEDEFLRYLWRDYLYPKQYEWVLIAAYVAVFVVALVGNTLVCLAVWRNHHMRTVTNYFIV 8 6 
II : I : I I : I : I : I : I : :: I I I : I I I I I I I I I I | I | : 
129 YSDINITYV-NYYLHQPQVAAI FI I S YFLI FFLCMMGNTWCFIVMRNKHMHTVTNLFI L 187 

87 N L S LAD VL VT AI C L PAS L LVD I T E S WL FGHAL C KVI P YLQAVS VS VAVL T L S F I AL DRW Y 14 6 
I I : : : I : I I I : I : I I : I I I I : : I | : : I : I I : : | II I I : I I : 
188 NLAI S DLLVGI FCMPITLLDNI I AGWP FGNTMCKI S GLVQGI S VAASVFTLVAI AVDRFQ 247 

147 AICHPLLFKSTARRARGSILGIWAVSLAIMVPQAAVMECSSVLPELANRTRLFSV 201 

: : I I I : I I : I I : : : I I | | : : | | | | | | 

248 CWYPFKPKLTIKTAFVIIMIIWVLAITIMSPSAVMLH VQEEKYYRVRLNSQNKTSP 304 

202 CDERWADDLYPKIYHSCFFIVTYLAPLGLMAMAYFQIFRKLWGRQIPGTTSALVRNW 258 

III: I I I : I I I I I I I : : I : I | : : I I | 
305 VYWCREDWPNQEMRKIYTTVLFANIYLAPLSLIVIMYGRIGISLFRAAVPHTGRKNQEQW 364 



Qy 259 KRPSDQLGDLEQGLSGEPQPRARAFLAEVKQMRARRKTAKMLMVVLLVFALCYLPISVLN 318 

I I : : I I I I : : I I : I I : I I : I 

Db 365 Hws RKKQKIIKMLLIVALLFILSWLPLWTLM 396 

Qy 319 VLKRVFGMFRQASDREA VYACFTFSHWLVYANSAANPII YNFLSGKFREQFKAAF 373 

: I : I I ' I s I : I I I : I I : I I I I I I : II I : M 

Db 397 MLSD YADLSPNELQIINIY-IYPFAHWLAFGNSSVNPIIYGFFNENFRRGFQEAF 450 



RESULT 11 
NFF2_RAT 

ID NFF2_RAT STANDARD; PRT; 417 AA. 

AC Q9EQD2 ; 

DT 16-OCT-2001 (Rel. 40, Created) 

DT 16-OCT-2001 (Rel. 40, Last sequence update) 

DT 16-OCT-2001 (Rel. 40, Last annotation update) 

DE Neuropeptide FF receptor 2 (Neuropeptide G protein-coupled receptor) . 

GN NPGPR OR NPFF2. 

OS Rattus norvegicus (Rat) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Rattus. 

OX NCBI_TaxID=10116; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC STRAIN=Sprague-Dawley; 

RX MEDLINE=20564301; PubMed=11024015 ; 

RA Bonini J. A., Jones K.A., Adham N., Forray C, Artymyshyn R. , 

RA Durkin M.M., Smith K.E., Tamm J. A. , Boteju L.W. , Lakhlani P.P., 

RA Raddatz R. , Yao W.-J., Ogozalek K.L., Boyle N., Kouranova E.V., 

RA Quan Y., Vaysse P. J., Wetzel J.M., Branchek T.A., Gerald C. f 

RA Borowsky B. ; 

RT "Identification and characterization of two G protein-coupled 

RT receptors for neuropeptide FF . " ; 

RL J. Biol. Chem. 275:39324-39331(2000). 

CC -!- FUNCTION: Receptor for NPAF (A-18-F- amide) and NPFF (F-8-F-amide) 
cc neuropeptides, also known as morphine-modulating peptides. Can 

CC also be activated by a variety of naturally ocurring or synthetic 

CC FMRF- amide like ligands. This receptor mediates its action by 

CC association with G proteins that activate a phosphatidylinositol- 

CC calcium second messenger system. 

CC -!- SUBCELLULAR LOCATION: Integral membrane protein. 

CC -!- SIMILARITY: Belongs to family 1 of G-protein coupled receptors. 
CC HIGHEST TO OREXIN, NPY AND CHOLECYSTOKININ RECEPTORS. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib . ch) . 

CC 

DR EMBL; AF268900; AAG41399.1; 

DR InterPro; IPR000276; GPCR_Rhodpsn . 

DR InterPro; IPR005395; NPFF_receptor . 

DR InterPro; IPR005397; NPFF_receptor2 . 



DR Pfam; PF00001; 7tm_l; 1. 

DR PRINTS; PR00237; GPCRRHODOPSN . 

DR PRINTS; PR01570; NPFFRECEPTOR. 

DR PRINTS; PR01572; NPFFRECEPTR2 . 

DR PROSITE; PS00237; G_PR0TEIN_RECEP_F1_1 ; 1. 

DR PROSITE; PS50262; G_PR0TEIN_RECEP_F1_2 ; 1. 

KW G-protein coupled receptor; Transmembrane; Glycoprotein. 

FT DOMAIN 1 45 EXTRACELLULAR (POTENTIAL). 

FT TRANSMEM 4 6 66 1 (POTENTIAL) . 

FT DOMAIN 67 82 CYTOPLASMIC (POTENTIAL). 

FT TRANSMEM 83 103 2 (POTENTIAL). 

FT DOMAIN 104 119 EXTRACELLULAR (POTENTIAL) . 

FT TRANSMEM 120 140 3 (POTENTIAL) . 

FT DOMAIN 141 160 CYTOPLASMIC (POTENTIAL) . 

FT TRANSMEM 161 181 4 (POTENTIAL). 

FT DOMAIN 182 217 EXTRACELLULAR (POTENTIAL). 

FT TRANSMEM 218 238 5 (POTENTIAL) . 

FT DOMAIN 239 274 CYTOPLASMIC (POTENTIAL) . 

FT TRANSMEM 275 295 6 (POTENTIAL). 

FT DOMAIN 296 310 EXTRACELLULAR (POTENTIAL). 

FT TRANSMEM 311 331 7 (POTENTIAL). 

FT DOMAIN 332 417 CYTOPLASMIC (POTENTIAL). 

FT DISULFID 118 206 BY SIMILARITY. 

FT CARBOHYD 8 8 N-LINKED (GLCNAC. . .) (POTENTIAL) 

FT CARBOHYD 20 20 N-LINKED (GLCNAC. . .) (POTENTIAL). 

FT CARBOHYD 31 31 N-LINKED (GLCNAC. . .) (POTENTIAL). 

FT CARBOHYD 198 198 N-LINKED (GLCNAC. . .) (POTENTIAL). 

SQ SEQUENCE 417 AA; 47710 MW; D FEB 6AC 6 3 AF2 AAD 6 CRC64; 

Query Match 22.5%; Score 499; DB 1; Length 417; 

Best Local Similarity 31.4%; Pred. No. 3.6e-27; 

Matches 118; Conservative 61; Mismatches 135; Indels 62; Gaps 

PDYEDEFLRYLWRDYLYPKQYEWLIAAWAVFWALVGNTLVCLAVWRNHHMRTVTNYI 
1 1 I : I : I I : I I : : I : I : : I I I I : I I I I I : I I I I I : | 

PWYSDINITYM-NYYLHQPHVTAVFISSYFLIFFLCMVGNTWCFWIRNRYMHTVTNFI 

I VNLS LADVLVTAI CL PAS LLVDI TE SWLFGHALCKVI P YLQAVS VSVAVLTLS FI ALDF 
I I I : : : I : II I : I : I I : | I | | : : | | : : I : | | : : | | | | | : | | 
I FNLAI SDLLVGI FCMPI TLLDNI I AGWPFGS SMCKI SGLVQGI SVAASVFTLVAIAVDF 

WYAICHPLLFKSTARRARGSILGIWAVSLAIMVPQAAVMECSSVLPELANRTRLFS 

: : ; l I I : I I: I I ::: I I I | :: | | [ || | 

FRCWYPFKPKLTVKTAFVMIVIIWGLAITIMTPSAIMLH— VQEEKYYRVRLSSHNKT 

VCDERWADDLYPKIYHSCFFrv*TYLAPLGLMAMAYFQIFRKLWGRQIPGTTSALVP 

Ml: : I I : I I I I I I I : : | : | | : | 2 



• : • I I I I : I I : I I : I I : 
-HVS KKKQKVI KMLLTVALLFI LSWLPLWT 291 



Qy 


25 


Db 


25 


Qy 


85 


Db 


84 


Qy 


145 


Db 


144 


Qy 


201 


Db 


201 


Qy 


257 


Db 


261 


Qy 


317 


Db 


292 



LNVLK RVFGMFRQAS DREAVYAC FT F S HWLVYAN SAAN P 1 1 YN FL S GKFR 366 

1:1 1 1 : : : I : I I I : I I : | | | | | | : | | 

LMMLSDYADLSPNKLRVINIY VYPFAHWLAFCNS SVNPI I YGFFNENFR 340 



Qy 367 EQFKAAFSCCLPGLGP 382 

I : II I : I 
Db 341 SGFQDAFQFCQKKVKP 356 



RESULT 12 
CCKR RABIT 



ID 

AC 

DT 

DT 

DT 

DE 

GN 

OS 

OC 

OC 

OX 

RN 

RP 

RX 

RA 

RT 

RL 

CC 

CC 

CC 

CC 

CC 

CC 

CC 

CC 

CC 

CC 

CC 

CC 

CC 

CC 

CC 

CC 

CC 

DR 

DR 

DR 

DR 

DR 

DR 

DR 

KW 

KW 



CCKR_RABIT STANDARD; PRT; 427 AA. 

097772; 

16-OCT-2001 (Rel. 40, Created) 

16-OCT-2001 (Rel. 40, Last sequence update) 

10-OCT-2003 (Rel. 42, Last annotation update) 

Cholecystokinin type A receptor (CCK-A receptor) (CCK-AR) 

CCKAR. 

Oryctolagus cuniculus (Rabbit) . 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleos tomi ; 
Mammalia; Eutheria; Lagomorpha; Leporidae; Oryctolagus. 
NCBI_TaxID=9986; 
[1] 

SEQUENCE FROM N.A. 

MEDLINE=95002144; PubMed=7918 628 ; 

Reuben M. , Rising L., Prinz C, Hersey S., Sachs G. ; 

"Cloning and expression of the rabbit gastric CCK-A receptor."; 

Biochim. Biophys . Acta 1219:321-327(1994). 

-!- FUNCTION: Receptor for cholecystokinin. Has a 1000 fold affinity 
for CCK rather than for gastrin. It modulates feeding and 
dopamine-induced behavior in the central and peripheral nervous 
system. This receptor mediates its action by association with G 
proteins that activate a phosphatidylinositol-calcium second 
messenger system (By similarity) . 

SUBCELLULAR LOCATION: Integral membrane protein. 

SIMILARITY: Belongs to family 1 of G-protein coupled receptors. 

This SWISS-PROT entry is copyright. It is produced through a collaboration 
between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 
the European Bioinf ormatics Institute. There are no restrictions on its 
use by non-profit institutions as long as its content is in no way 
modified and this statement is not removed. Usage by and for commercial 
entities requires a license agreement (See http://www.isb-sib.ch/announce/ 
or send an email to license@isb-sib.ch). 

EMBL; U86601; AAD11547.1; -. 
HSSP; P02699; 1F88. 

InterPro; IPR000276; GPCR_Rhodpsn . 

Pfam; PF00001; 7tm_l; 1. 

PRINTS; PR00237; GPCRRHODOPSN . 

PROSITE; PS00237; G_PR0TEIN_RECEP_F1_1 ; 1. 

PROSITE; PS50262; G_PROTEIN_RECEP_Fl_2 ; 1. 

G-protein coupled receptor; Transmembrane; Glycoprotein; 

Lipoprotein; Palmitate. 
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1 (POTENTIAL) . 
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CYTOPLASMIC (POTENTIAL) . 
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104 


2 (POTENTIAL) . 
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115 


EXTRACELLULAR (POTENTIAL) . 
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3 (POTENTIAL) . 
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138 
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CYTOPLASMIC (POTENTIAL) . 
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24 
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FT 
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1 Qfl 
X zj U 


1 q n 
i y u 


FT 


DISULFID 


114 


196 


FT 


LIPID 


386 


386 


SQ 


SEQUENCE 


427 AA; 


473 


Query Match 




21 



4 (POTENTIAL) . 
EXTRACELLULAR (POTENTIAL) . 

5 (POTENTIAL) . 
CYTOPLASMIC (POTENTIAL) . 

6 (POTENTIAL) . 
EXTRACELLULAR (POTENTIAL) . 

7 (POTENTIAL) . 
CYTOPLASMIC (POTENTIAL) . 
N-LINKED (GLCNAC. . .) (POTENTIAL). 
N-LINKED (GLCNAC. . .) (POTENTIAL). 
N-LINKED (GLCNAC. . .) (POTENTIAL). 
BY SIMILARITY. 

S-palmitoyl cysteine (By similarity) . 
MW; 089FD10E2B86DB25 CRC64 ; 

; Score 4 66.5; DB 1; Length 427; 
Best Local Similarity 29.6%; Pred. No. 6e-25; 

Matches 128; Conservative 86; Mismatches 167; Indels 51; Gaps 13; 

Q y 8 GAQMGVPPGSREPSPVPPDYEDEFLRYLWRDYLYPKQYEW VLIAAYVAVFVVALVGN 64 

I 1 = 11 I ::| I | : | || I I I : | : : : : : | | 

Db 9 GNASGIPP PCELGLDNETLFCLDQP PPSKEWQPAVQILLYSLIFLLSVLGN 59 

Q y 65 TLVCIAWRNHHMRTVTNYFIWLSIAIjVijW 12 4 

HI : M I I I I I I I : :: I : : : | : : : I : | : | :::::: | | | | | | | 
Db 60 TLV ITVLIRNKRMRTWNIFLLSLAISDLMLCLFCMPFNLIPNLLKDFIFGSALCKTTTY 119 

Q y 125 L Q AVS VS VAVL T L S F I AL D RW YA I CHPLLFK — S T AR RARG S I L G I W AVS LAI MVP QAAV 182 

I MM: I I |:|:|: ||| || : | | | | :| ||| | 
Db 120 LMGT S VS VS T LN L VAI S L E R Y GAI C K P LQ S RVWQ T K S HAL KVI AATWC L S FAI MT P Y P I Y 179 

Qy 183 MECSSVLPELANRTRLFSVCDERWADDLYPKI YHSCFFIVTYLAPLGLMAMAYFQIFRKL 242 

I : : • I : : : I I : : : I : : : : | | : | : | | | : | 

Db 180 — SN LVPFTKTNNQTANMCRFLLPSDVMQQAWHTFLLLILFLIPGIVMMVAYGMISLEL 236 

Qy 243 W-GRQIPGTTSALVRNWK RPSDQLGDLEQ— GLSGEPQPRA 280 

: I : : : I : I : M : I : I | | | 

Db 237 YQGIKFDASQKKSAKERKASTGSGRFEDNDGCYLQRSKPTRQL-ELQQLSGGGGGRVSRI 295 

Qy 281 RAFLAEVKQMRARRKTAKMLMWLLVFALCYLPISVLNVLKRVFGMFRQASDREAVYACF 340 

I : : : I : : : : II I I : : : : | | | : : | | | : s . - . | 
Db 2 96 RS-SSSAATLMAKKRVI RMLMVIWLFFLCWMPIFSANAWRAYDTV SAERRLSGTPI 351 

®y 341 TFSHWLVYANSAANPIIYNFLSGKFREQFKAAFSCCLPGLGPCGSLKAPSPRS SASHKSL 400 

: I I I : I I I II I I : : : I I I I I I I I II I I I : I : 

Db ■• 352 SFILLLSYTSSCVNPIIYCFMNRRFRLGFMATFPCC-PNPGP PGPRAEAGEEEE 404 

Qy 401 SLQSRCSVSKIS 412 

: I I : I : I 
Db 4 05 GRTTRASLSRYS 416 



RESULT 13 
NY2R_PIG 

ID NY2R_PIG STANDARD; PRT ; 382 AA. 

AC 002836; Q9TSI1; 

DT 15-JUL-1998 (Rel. 36, Created) 



DT 16-OCT-2001 (Rel. 40, Last sequence update) 

DT 15-MAR-2004 (Rel. 43, Last annotation update) 

DE Neuropeptide Y receptor type 2 (NPY2-R) (NPY-Y2 receptor) . 

GN NPY2R. 

OS Sus scrofa (Pig) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Cetartiodactyla ; Suina; Suidae; Sus 

OX NCBI_TaxID=9823 ; 
RN [1] 

RP SEQUENCE FROM N.A. 

RC TISSUE=Spleen; 

RX MEDLINE=99017380; PubMed-98 02394 ; 

RA Malmstroem R.E., Hoekfelt T., Bjoerkman J. -A., Nihlen C, Bystroem M., 

RA Ekstrand A. J., Lundberg J.M.; 

RT "Characterization and molecular cloning of vascular neuropeptide Y 

RT receptor subtypes in pig and dog."; 

RL Regul. Pept . 75:55-70(1998). 

RN [2] 

RP REVISION TO 207. 

RA Ekstrand A. J. ; 

RL Submitted (AUG-2000) to the EMBL/GenBank/DDBJ databases. 

RN [3] 

RP SEQUENCE FROM N.A. 

RX MEDLINE=21184969; PubMed=112 87083 ; 

RA Larhammar D., Wraith A., Berglund M.M. , Holmberg S.K., Lundell I.; 

RT "Origins of the many NPY- family receptors in mammals."; 

RL Peptides 22:295-307 (2001) . 

CC -!- FUNCTION: Receptor for neuropeptide Y and peptide YY. 

CC -!- SUBCELLULAR LOCATION: Integral membrane protein. 

CC -!- SIMILARITY: Belongs to family 1 of G-protein coupled receptors. 
cc Highest to tachykinins receptors. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 
entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib . ch) . 

CC 

DR EMBL; AF005780; AAC26670.2; -. 

DR EMBL; AF106082; AAD13777.1; ALT_INIT. 

DR HSSP; P02699; 1F88. 

DR InterPro; IPR000276; GPCR_Rhodpsn . 

DR Pfam; PF00001; 7tm_l; 1. 

DR PRINTS; PR00237; GPCRRHODOPSN . 

DR PROSITE; PS00237; G_PROTEIN_RECEP_Fl_l ; 1. 

DR PROSITE; PS50262; G_PR0TEIN_RECEP_F1_2 ; 1. 

KW G-protein coupled receptor; Transmembrane; Glycoprotein; 

KW Phosphorylation; Lipoprotein; Palmitate. 

FT DOMAIN 1 50 EXTRACELLULAR (POTENTIAL) 

FT TRANSMEM 51 73 1 (POTENTIAL). 

FT DOMAIN 74 83 CYTOPLASMIC (POTENTIAL) 

FT TRANSMEM 84 105 2 (POTENTIAL). 

FT DOMAIN 106 125 EXTRACELLULAR (POTENTIAL) 

FT TRANSMEM 126 147 3 (POTENTIAL). 

FT DOMAIN 148 167 CYTOPLASMIC (POTENTIAL). 



FT 


TRANSMEM 
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^1 J- -J 
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TRANSMEM 
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^ l l Urij/\oIYL-LU irUlrjNIlAL) . 


FT 


TRANSMEM 
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1 1 


1 1 
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N-LINKED (GLCNAC. . .) (POTENTIAL 


FT 


DISULFID 


124 


9 f)d 


ox blMl LiAKl I Y . 


FT 


LIPID 


343 


343 


S-palmitoyl cysteine (Potential). 


FT 


CONFLICT 


4 


4 


I -> L (IN REF. 3) . 


FT 


CONFLICT 


179 


179 


L -> V (IN REF. 3) . 


FT 


CONFLICT 


215 


215 


I -> N (IN REF. 3) . 


FT 


CONFLICT 


366 


366 


A -> V (IN REF. 3) . 


SQ 


SEQUENCE 


382 AA; 


42556 


MW; 5C01FAF3A0423858 CRC64; 


Query Match 




20. 3h 


hi Score 451; DB 1; Length 382; 



j f - - — ^ . . v . j.^ r 

Matches 118; Conservative 75; Mismatches 141; Indels 78; Gaps 12; 



Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 



1 ME P SAT PGAQMGVP PGS RE P- S P VP PD YEDEFLRYLWRD YL YP KQ YEWVLI AAYVAVFVV 59 
I I I I II I : I I I | : | : : : | | | | :: :: 

19 MEPSG PGHTTPRGELAPDSEPEL KDSTKLIEVQIILILAYCSIILL 64 

60 AL VGN T LVCIAWRNH HMRT VTN Y FI VN L S LAD VL VTAI C L PAS LL VD I T E S WL FGHALC 119 

MII:M I : Mill hi I I |::| |:| | :||| :| : | | || 
65 GWGNSLVIHWIKFKSMRTVTNFFIANLAVADLLVNTLCLPFTLTYTLMGEWKMGPVLC 124 

120 KVIPYLQAVSVSVAVLTLSFIALDRWYAICHPLLFKSTARRARGSILGI-WAVSLAIMVP 178 

: : I I I : : I I : : I I : I I I M I : I I : : | | : | : | : | ; | 

125 HLVPYAQGLAVQVSTITLTVIALDRHRCIVYHLESK-ISKRISFLIIGLAWGISALLASP 183 

179 QAAVMECS SVLPELANRTRLFSVCDERWADD LYPKIYHSCFFIVTYLAPLGLMAM 233 

I II : : I : I I : I : : I : | : : I : I I I ::: 

184 LAI FREYSLIEIIPDFE IVACTEKWPGEEKSIYGTVYSLSSLLILYVLPLGII SF 238 

234 AYFQIFRKLWGRQIPGTTSALVRNWKRPSDQLGDLEQGLSGEPQPRARAFLAEVKQMRAR 293 

: I : I : I I II | : : | , 
239 SYARIWSKLKNHVSPG GVNDHYHQR R 2 64 

294 RKT AKMLMWLLVFALC YL P I S VLNVLKRVFGMFRQAS DREAVYAC FT FS HWLVYAN SAA 353 

: I I I I I : I : : I I I : : | | | : : : | | : | | | . :: , 

265 QKTTKMLVCVVWFAVSWLP LHAFQLAVDIDSQVLDLKEYKLIFTVFHIIAMCSTFA 321 

354 NPIIYNFLSGKFREs^FKAAFSCCLPGLGPCGSLKAPSPRSSASHKSLSLQSR 405 

I I : : I : : : : | : | : | | | | | | : | : | : 

322 NPLLYGWMNSNYRKAFLSAFRC EQRLDAIHSEVSMTSK 359 



RESULT 14 
NYR_DROME 

ID NYR_DROME STANDARD; PRT; 44 9 AA. 

AC P25931; 

DT 01-MAY-1992 (Rel. 22, Created) 

DT 01-MAY-1992 (Rel. 22, Last sequence update) 

DT 01-AUG-1992 (Rel. 23, Last annotation update) 



DE Neuropeptide Y receptor (NPY-R) (PR4 receptor) . 

GN NEPYR. 

OS Drosophila melanogaster (Fruit fly) . 

OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; 

OC Neoptera; Endopterygota ; Diptera; Brachycera; Muscomorpha; 

OC Ephydroidea; Drosophilidae ; Drosophila. 

OX NCBI_TaxID=7227; 

RN [1] 

RP SEQUENCE FROM N.A. 

RX MEDLINE=92112730; PubMed=137 0455 ; 

RA Li X.-J., Wu Y.-N., North R.A. , Forte M. ; 

RT "Cloning, functional expression, and developmental regulation of a 

RT neuropeptide Y receptor from Drosophila melanogaster."; 

RL J. Biol. Chem. 267:9-12(1992). 

FUNCTION: Receptor for neuropeptide Y and peptide YY. 
SUBCELLULAR LOCATION: Integral membrane protein. 

DEVELOPMENTAL STAGE: At low levels during early embryonic stages, 
cc its expression increases later and reaches the highest level 

CC during late stages of embryogenesis . Subsequently, PR4 levels are 

CC reduced during larval stages and increase during pupal stages. 

CC -!- SIMILARITY: Belongs to family 1 of G-protein coupled receptors. 
cc Highest to tachykinins receptors. 



CC 
CC 
CC 



CC 



KW 



CC This SWISS-PROT entry is copyright. It is produced through a collaboration 
CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 
CC the European Bioinf ormatics Institute. There are no restrictions on its 
CC use by non-profit institutions as long as its content is in no way 
CC modified and this statement is not removed. Usage by and for commercial 
CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 
CC or send an email to license@isb-sib . ch) . 

CC 

DR EMBL; M81490; AAA28727.1; -. 
DR PIR; A41738; A41738. 
DR FlyBase; FBgn0004842; NepYr. 
DR InterPro; IPR000276; GPCR_Rhodpsn . 
DR Pfam; PF00001; 7tm_l; 1. 
DR PRINTS; PR00237; GPCRRHODOPSN . , 
DR PROSITE; PS00237; G_PR0TEIN_RECEP_F1_1 ; 1. 
DR PROSITE; PS502 62; G_PR0TEIN_RECEP_F1_2 ; 1. 

G-protein coupled receptor; Transmembrane; Glycoprotein. 

EXTRACELLULAR (POTENTIAL). 

1 (POTENTIAL) . 
CYTOPLASMIC (POTENTIAL) . 

2 (POTENTIAL) . 
EXTRACELLULAR (POTENTIAL) . 

3 (POTENTIAL). 
CYTOPLASMIC (POTENTIAL) . 

4 (POTENTIAL) . 
EXTRACELLULAR (POTENTIAL) . 

5 (POTENTIAL) . 
CYTOPLASMIC (POTENTIAL) . 

6 (POTENTIAL). 
EXTRACELLULAR (POTENTIAL) . 

7 (POTENTIAL) . 
CYTOPLASMIC (POTENTIAL) . 
N-LINKED (GLCNAC. . .) (POTENTIAL). 
N-LINKED (GLCNAC. . .) (POTENTIAL). 
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1 


85 


FT 


TRANSMEM 


86 
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FT 
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112 
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FT 


TRANSMEM 
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FT 
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FT 


TRANSMEM 


166 


187 


FT 


DOMAIN 


188 


203 


FT 


TRANSMEM 


204 


232 


FT 


DOMAIN 


233 


256 


FT 


TRANSMEM 


257 


282 


FT 


DOMAIN 


283 


309 


FT 


TRANSMEM 


310 


330 


FT 


DOMAIN 


331 


347 


FT 


TRANSMEM 


348 


372 


FT 


DOMAIN 


373 


449 


FT 


CARBOHYD 


34 


34 


FT 


CARBOHYD 


64 


64 



FT CARBOHYD 70 70 N-LINKED (GLCNAC. . .) (POTENTIAL). 

SQ SEQUENCE 449 AA; 51885 MW; 1ADB32EFD5014 6C3 CRC64; 



Query Match 20.2%; Score 447.5; DB 1; Length 449; 

Best Local Similarity 29.4%; Pred. No. 1.2e-23; 

Matches 109; Conservative 63; Mismatches 138; Indels 61; Gaps 9 

Q y 26 DY EDEFLRYLWRDYLYPKQYEWLIAAWAVFWALVGNTLVCIAWRNHHMRTVTNYFI 85 

I I; II :: :: I : : I : I I : I I | | | : I Ml MM I 

Db 75 °YD LLSEDMWSSAYFKIIVYMLYIPIFIFALIGNGTVCYIVYSTPRMRTVTNYFI 12 9 

Q y 86 ^LSIADVLVTAICLPASLL-VDITESWLFGHALCKVIPYLQAVSVSVAVLTLSFIALDR 144 

: ' : : I : I : : I I : I : : I I II Ml : I I I M I I : II I : : | | 
Db 130 AS LAI GDI LMS FFCEPS S FI S LFI LN YWPFGLALCHFW YSQAVSVLVSAYTLVAI S I DR 189 

145 WY AICHPLLFKSTARRARGSILGIWAVSLAIMVPQAAVNECSSVLPELANRTRLFSVCDE 204 
: M I I ' I I I I I : I : : I I : I I : : : : | | 

Db 190 YIAIMWPLKPRITKRYATFIIAGVWFIALATALPIPIVSGLDIPMSPWHTKCEKY-ICRE 248 

Qy 205 RWADDLYPKIYHSCFFIVTYLAPLGLMAMAYFQIFRKLWGRQIPGTTSAiVRNWKRPSDQ 264 

I I I : :: Ml:: I : I : : I : : M : | | 

Db 249 MWP S RS QE Y Y YT L S L FALQ FWP LGVL I FT YARI T I RVWAKRP P GEA ETNRDQ 301 

265 LGDLEQGLSGEPQPRARAFLAEVKQMRARRKTAKMLMWLLVFALCYLPISVLNVLKRVF 324 

: I • = I I M : : I : : M I : M : : I : I 
Db 302 RMARSKRKMVKMMLTWIVFTCCWLPFNILQLL 334 

^ 325 GMFRQAS DREAV YACFTFSHWLVYANSAANPIIYNFLSGKFREQFKAAFS 374 

Ml I I I I M : : M I M : : : : M I 

Db 335 LNDEEFAHWDPLPYVTVFAF-HWLAMSHCCYNPIIYCYMNARFRSGFVQLMHRMPG 388 

Qy 375 CCLPGLG 381 

Ml : I 

Db 38 9 LRRWCCLRSVG 399 



RESULT 15 
OXlR_PIG 

ID OXlR_PIG STANDARD; PRT; 86 AA. 

AC 097661; 

DT 30-MAY-2000 (Rel. 39, Created) 

DT 30-MAY-2000 (Rel. 39, Last sequence update) 

DT 28-FEB-2003 (Rel. 41, Last annotation update) 

DE Orexin receptor type 1 (Oxlr) (Hypocretin receptor type 1) (Fraament) 

GN HCRTR1. y 

OS Sus scrofa (Pig) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleos tomi ; 

OC Mammalia; Eutheria; Cetartiodactyla ; Suina; Suidae; Sus. 

OX NCBIJTaxID=9823; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC TISSUE=Hypo thalamus ; 

RA Matteri R.L., Dyer C.J.; 

RT "Partial cDNA sequence of the porcine type 1 orexin receptor."; 

RL Submitted (OCT-1998) to the EMBL/ GenBank/DDBJ databases. 

CC FUNCTION: Moderately selective excitatory receptor for orexin-A 

CC and, with a lower affinity, for orexin-B neuropeptide. Seems to be 



cc 

CC 

CC 

CC 

CC 

CC 

CC 

CC 

CC 

CC 

CC 

CC 

CC 

CC 

DR 

DR 

DR 

DR 

KW 

FT 

FT 

FT 

FT 

FT 

FT 

FT 

SQ 



exclusively coupled to the G(q) subclass of heteromeric G 
proteins, which activates the phospholipase C mediated signaling 
cascade (By similarity) . 

SUBCELLULAR LOCATION: Integral membrane protein. 

SIMILARITY: Belongs to family 1 of G-protein coupled receptors. 

This SWISS-PROT entry is copyright. It is produced through a collaboration 
between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 
the European Bioinf ormatics Institute. There are no restrictions on its 
use by non-profit institutions as long as its content is in no way 
modified and this statement is not removed. Usage by and for commercial 
entities requires a license agreement ( See http : //www. isb-sib . ch/announce/ 
or send an email to license@isb-sib . ch) . 

EMBL; AF097995; AAC68703.1; -. 
InterPro; IPR000276; GPCR^Rhodpsn . 

PROSITE; PS 00237; G_PR0TEIN_RECEP_F1_1 ; PARTIAL. 
PROSITE; PS50262; G_PR0TEIN_RECEP_F1_2 ; 1. 
G-protein coupled receptor; Transmembrane. 



NONJTER 

TRANSMEM 

DOMAIN 

TRANSMEM 

DOMAIN 

TRANSMEM 

NONJTER 

SEQUENCE 



1 
<1 

9 
31 
52 
83 
86 
86 AA; 



30 
51 
82 
>86 
86 

9853 MW; 



Query Match 19.7%; 
Best Local Similarity 96.5%; 
Matches 83; Conservative 



3 (POTENTIAL) . 
CYTOPLASMIC (POTENTIAL) . 

4 (POTENTIAL) . 
EXTRACELLULAR (POTENTIAL) . 

5 ( POTENTIAL) . 

C34177B3BC8590E8 CRC64; 

Score 437; DB 1; Length 86; 
Pred. No. 1.4e-23; 
1; Mismatches 2; Indels 



0; Gaps 



0; 



Qy 

Db 

Qy 

Db 



135 LTLSFIALDRWYAICHPLLFKSTARRARGSILGIWAVSLAIMVPQAAVMECSSVLPELAN 194 

I I I I I I I I I I I I I I I I II I I I I I I I I I I | | | | | | M I 1:1 I I I I I I I I | I | | | 

1 LTLSFIALDRWYAIYHPLLFKSTARRARGSILGIWAVS PAVMVPQAAVMECSSVLPELAN 60 

195 RTRLFSVCDERWADDLYPKIYHSCFF 220 
I I I I I I I I I I I I I I I I I I I I I I I II I 
61 RTRLFSVCDERWADDLYPKIYHSCFF 8 6 



Search completed: September 28, 2004, 09:50:25 
Job time : 15 sees 



